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Abstract 

Modelling  With  Integer  Variables 
Capt.  James  K.  Lowe,  USAF 
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\  I  169  pages 

Representing  nonlinear  optimization  problems  as  mixed-integer 
programs  has  largely  been  considered  as;  1)  an  "art”  with  few  unresolved 
theoretical  issues,  and  2)  a  fairly  standard  ^preprocessing”  routine  when 
combined  with  several  ad  hoc  modelling  improvements  which  have  evolved 
through  computational  experience.  The  more  common  avenue  of  research  in 
mixed-integer  programming  has  focused  upon  finding  improved  algorithms 
and  heuristics  to  solve  the  problems/  assuming  a  standard  mixed-integer 
representation  exists.  In  Jhis  thesis,  we  take^^a  step  backwards  and 
re-examine5^the  theoretical  issues  and  subtleties  involved  in  representing 

problems  with  mixed-integer  representations.  _ . _ - 

As  a  result  of  our  theoretical  investigation,  necessary  and 
sufficient  conditions  are  given  which  guarantee  the  existence  of  a  mixed 
integer  representation  for  a  given  problem.  The  conditions  explain  some 
of  the  subtleties  surrounding  well-known  problems  such  as  the  fixed- 
charge  problem,  which  requires  an  upper  bound  on  the  variable.  Our 
results,  always  in  the  rational  field,  concern  the  representability  of  a 
finite  union  of  sets.  Many  results  extend  Rockafellar 's  finite  basis 
theorems . 


In  our  theoretical  investigation,  we  employ  a  system  of 
inequalities  derived  from  disjunctive  programming  to  help  prove  the 
existence  of  a  mixed-integer  representation  for  a  finite  union  of 
polyhedra  whose  recession  directions  satisfy  certain  criteria.  Uhile  the 
system  of  inequalities  is  not  new,  its  use  as  a  mixed-integer  program 
representation  is  new.  Another  mixed-integer  representation  is  presented 
which  utilizes  the  "extreme  points"  of  the  individual  polyhedra  (when 
representing  a  finite  union  of  polyhedra).  Both  representations  are 
automatic,  easy  to  use,  and  both  have  the  property  of  sharpness. 

Sharpness  concerns  the  amount  of  information  lost  as  the  integer 
variables  are  relaxed  to  the  continuous  rational  field. 

Whether  a  particular  representation  is  sharp  or  not  often 
determines  whether  a  problem  is  solvable  within  minutes  of  computation  or 
remains  unsolved  after  hours  of  computation.  Our  automatic  representa¬ 
tions  are  always  sharp,  they  often  preserve  their  sharpness  after 
variable  arbitration  (as  in  a  branch-and-bound  setting),  and  have  shown 
tremendous  efficiency  and  time  savings  for  the  problems  and  modellings 
explored. 
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INTRODUCTION 

A  widely  accepted  view  in  integer  programming  research  is  that 
progre.es,  in  utilizing  (mixed-)  integer  formulations  to  solve  real-world 
problems,  will  rely  primarily  on  algorithm  advances  (for  general  and 
apecial  structures)  and  clever  coding  tricks.  In  this  view,  we  are 
perceived  as  already  knowing  how  to  represent  a  real-world  problem  with 
integer  variables;  those  simple  “formulation  techniques"  appear  early  in 
the  subject  and  by  now  are  widespread  in  the  masters '-level  and  even 
undergraduate-level  textbooks,  (see  e.g.  (14],  (42])  However,  since  in 
the  1970's,  the  experience  of  practitioners  indicate  that  some  major 
issues  of  formulation  have  been  overlooked.  For  example,  Geoffrion  and 
Graves  (16]  solve  a  large  scale  multicommodity  distribution  problem  which 
includes  fixed  charges.  In  their  modelling,  they  notice  that  to 
economize  on  the  number  of  constraints,  a  standard  linear  programming 
technique,  results  in  more  iterations  for  convergence  to  the  optimal 
solution.  In  another  example,  Villiams  (41)  finds  a  similar  advantage  of 
disaggregating  constraints  for  logical  system  problems,  which  he  models 
as  generalized  set-covering  problems.  Both  of  the  above  references 
indicate  a  computational  need  to  reformulate  HIP  representations  before 
attempting  to  solve  them. 
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One  of  the  earliest  integer  models  is  the  Fixed-Charge  problem  in 
which  a  fixed  cost  is  incurred  (the  objective  is  to  minimise  costs)  when 
a  certain  activity  level  is  non-xero  (see  Dantzig  [10]).  This  very 
common  problem  can  be  modelled  as  a  mixed-integer  program  (MIP),  provided 
that  an  explicit  upper  bound  is  kno%m  for  the  variable  which  "triggers" 
the  fixed  cost.  A  seemingly  similar  problem  is  the  Fixed-Benefit 
problem,  in  which  a  benefit  (rebate)  is  received,  rather  than  a  cost 
incurred  when  the  variable  is  nonzero.  This  problem  cannot  be 
represented  as  a  MIP  even  with  a  known  upper  bound  for  the  "trigger" 
variable.  As  we  shall  see  in  Chapter  2,  the  Fixed-Benefit  case  requires 
an  explicit  minimum  usage  level  which  must  be  met  before  the  benefit  is 
received,  if  it  is  to  be  represented  via  (bounded)  integer  variables. 

In  contrasting  the  fixed-charge  and  fixed-benefit  problems,  we 
^•ve  illustrated  one  of  the  _subtle£ie£^  of  modelling  with  integer 
variables  that  we  will  treat  in  this  thesis.  Several  other  aspects  of 
integer  modelling  will  be  discussed,  and  we  present  here  (we  hope 
convincing)  evidence  of  our  two  theses,  namely  that;  1)  There  are 
substantial  advantages  to  using  "better"  integer  models;  and  2)  The 
study  of  integer  modelling  has  many  aspects  which  are  amenable  to  exact, 
analytic  development. 

In  the  late  sixties,  and  early  seventies,  Benichou,  et  al.  [6], 
band  and  Doig,  Beale,  and  others  implement  improved  Branch-and-Bound 
based  commercial  mixed-integer  programming  systems  capable  of  solving 
many  practical  industrial  MIPs  (AIJ.  These  algorithmic  advances  provide 
opportunities  'o  conduct  computational  experiments  to  compare  various 
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ad  hoc  modellings  and  reformulations.  The  results  of  these  experiirents , 
notably  those  of  H.  P.  Willisms,  and  Ceoffrion  and  Graves  (who  used  a 
specially-designed  enumerative  code)  motivate  others  to  re-examine  MIP 
modelling  issues  for  computational  improvements. 

Williams  (41]  finds,  in  comparing  modellings,  that  “the  superior 
formulation  is  always  the  one  which  is  “tighter"  in  a  continuous  sense." 

By  "tighter,"  he  means  the  continuous  optimum  (integrality  relaxed)  is 
closer  to  the  integer  optimum.  Williams  experiments  involve  integer 
variables  in  both  flow  problems  and  logic  problems.  His  ad  hoc  methods 
of  problem  reformulation  involve  coefficient  reduction  and  disaggregation 
of  constraints. 

Ceoffrion  and  Graves  [16]  model  a  multicommodity  distribution 
system  and  provide  a  "lesson  on  Model  Representation."  (see  Section  5  of 
1 16].)  In  their  flow  problem,  they  provide  computational  results 
favoring  a  modelling  which  disaggregates  constraints.  They  realise  that 
the  disaggregation,  in  the  relaxed  case,  provides  the  convex  hull  of  the 
original  integer  feasible  solutions.  They  also  note  that  the  price  of 
the  tighter  bound  and  reduction  in  Branch-and-Bound  tree  branching  is  the 
additional  time  required  to  solve  the  larger  LP  relaxation  at  each  node 
of  the  Branch-and-Bound  tree.  Based  upon  their  computational  and 
theoretical  results,  they  "suggest  a  general  methodology  for  discovering 
improved  model  representations:  for  various  subsets  of  contraints 
involving  some  of  the  integer  variables,  try  to  explicitly  derive  the 
convex  hull  of  the  integer  feasible  points."  Much  of  our  work  in  this 
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thesis  is  in  effect  e  partial  reply  to  their  request  for  such  a 
methodology. 

Hardin  and  Choe  [AOJ  disaggregate  relevant  constraints  of  a  fixed- 
charge  multicommodity  network  flow  problem  by  way  of  an  arc-node 
*^*P^*s*ntation  which  explictly  defines  each  possible  path  from  each 
source  i,  to  all  sinks  j.  While  improving  the  value  of  the  LP 
relaxation,  this  technique  often  greatly  increases  the  size  of  the 
resulting  MIP.  They  provide  computational  results,  supporting  the 
effectiveness  of  the  improved  LP  relaxations. 

Oley  and  Sjoquist  [38],  Crowder,  Johnson,  and  Padberg  [9], 

Johnson,  Kostreva,  and  Suhl  (28)  automate  the  process  of  coefficient 

reduction  and  constraint  disaggregation.  Oley  and  Sjoquist  implement 
their  reformulation  techniques  in  CDC's  APEX  IV  math  programming  system. 
They  employ  a  pruning  technique  similar  to  that  found  in  Chapter  V  of 
this  thesis,  and  a  "big  M"  reduction  method  similar  to  that  employed  in 
Chapter  II  of  this  thesis. 

Johnson,  et  al.  [28]  employ  similar  reductions  and  disaggregations 
in  a  large  scale  planning  scenario.  In  their  problem,  they  encounter 
fixed-charges,  either/or  type  constraints,  and  special  ordered  set  (SOS) 
variables.  They  do  not  disaggregate  the  either/or  type  constraints  as  in 
Chapter  II  of  this  thesis. 

The  reformulation  sources  mentioned  above  appeal  to  the 
"computational  effectiveness"  justification  of  their  techniques.  While 
their  results  are  extremely  useful,  they  do  not  systematically  address 
the  issue  of  existence  of  a  MIP  modelling  for  a  given  problem,  nor 
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techniques  for  providing  modellings  vith  tight  linear  relaxations,  nor 
the  "hereditary"  effects  of  various  formulations  as  branching  proceeds  in 

a  branch-and-bound  procedure. 

As  we  shall  see  later  in  this  thesis,  improvements  in  modelling 
techniques  derive  from  what  begins  as  theoretical  investigations.  In  the 
mid  seventies,  Ibaraki  (21j,  and  Meyer  [33],  [34],  I35]  launch  a 
systematic  study  of  modelling  problems  as  MIPs.  In  accounting  for  their 
earlier  results  on  the  existence  of  modellings,  our  proofs  in  Chapter  II 
will  amount  to  formulation  techniques  which  have,  and  hereditarily 
preserve,  tight  linear  relaxations.  Our  results  are  either  derived  from, 
or  motivated  by  disjunctive  programming.  In  later  chapters  of  the 
thesis,  we  report  experiments  on  the  new  formulation  techniques. 

As  in  our  approach.  Bales  {1)  also  expresses  the  discrete 
optimization  problem  as  the  intersection  of  unions  of  polyhedra.  In  that 
paper,  he  introduces  operations  to  reduce  the  number  of  intersections, 
which  strengthens  the  relaxations.  In  this  manner  one  creates  a 
hierarchy  of  relaxations  ranging  from  the  original  LP  relaxation  to  the 
exact  convex  hull  of  integer  feasible  points.  This  hierarchy  of 
relaxations  in  some  respects  goes  beyond  the  theoretical  work  reported 
here,  although  it  is  similar  to  the  lattice  of  co-propositions  in  (24] 

that  contains  several  hierarchies. 

In  Chapter  II  of  this  thesis,  we  develop  necessary  and  sufficient 
conditions  concerning  when  a  union  of  polyhedra  is  MIP  representable.  In 
doing  so,  we  develop  an  automatic  MIP  representation  tor  any  bounded-MIP 
representable  union  of  polyhedra. 
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We  then  focus  upon  the  sharpness  of  this  HIP  representation. 
Sharpness  holds  when  the  LP  relaxation  of  a  MIP  modelling  is  exactly  the 
convex  hull  of  integer  feasible  solutions.  Our  automatic  MIP 
representation  is  always  sharp.  Typically,  it  is  obtained  for  a  subpart 
of  the  whole  mixed-integer  program,  such  as  a  fixed-charge,  piecewise- 
linear  function,  either/or  constraints,  etc.  This  part  must  be  then 
"linked"  into  the  main  program,  and  typically  the  result  is  not  sharp  for 
the  whole  program.  The  issues  raised  by  Che  "linkage"  process  are  iu 
part  the  substance  of  Bales'  hierarchy  in  [1]. 

In  Chapter  III  we  review  bounded-MIP  representability  and  conduct 
two  experiments  comparing  our  "sharp"  representations  to  common 
representations  found  in  current  literature  use.  Experiments  are 
necessary  since  the  SHARP  representation  (which  subsumes  the 
disaggregation  techniques  employed  by  others)  often,  but  not  always, 
results  in  much  larger  problems  in  terms  of  both  constraints  and 
continuous  variables. 

The  first  experiment  compares  our  larger  sharp  MIP  modelling  of 
"either/or"  type  constraints  with  a  more  concise,  non-sharp  modelling 
found  in  many  current  articles  and  textbooks  ([14]  for  instance).  The 
dramatic  lack  of  sharpness  of  the  common  modelling  proves  to  be  its  major 
defect,  as  the  sharp  modelling  performs  faster  in  terms  of  cpu  seconds 
and  more  efficiently  in  terms  of  Branch-and-Bound  nodes  solved  (many 
instances  show  an  improvement  of  over  100%,  increasing  with  problem 
size).  From  our  results,  only  in  very  small  instances  did  the  "non- 
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sharp”  nodel  perform  faster  than  our  "sharp"  representation.  As  others 
have  found  using  ad  hoc  techniques,  we  find  "sharpness"  as  a  key 
ingredient  to  successful  integer  programs. 

The  second  experiment  in  Chapter  III,  tests  whether  it  is 
computationally  favorable  to  disaggregate  a  non-linear  function  into 
"easier"  components;  model  each  component  individually;  and  then  "link" 
the  models  together  by  termwise  addition  in  the  objective  function  before 
solving  the  final  MIP.  We  test  this  aspect  of  modelling  linkage  on  a 
piecewise-linear  function  with  fixed-charges.  The  results  favor  the 
modelling  of  the  entire  function,  even  when  the  "linked"  modelling  parts 
are  modelled  using  our  sharp  modelling. 

Besides  testing  modelling  linkage,  this  experiment  also  provides 
evidence  favoring  "hereditarily"  sharp  models.  Unless  some  variable  co¬ 
ordination  is  performed  upon  the  linked  models,  they  rarely  will  be 
hereditarily  sharp.  Thus,  once  the  Branch-and-Bound  algorithm  begins  to 
arbitrate  variables,  the  modelling  loses  it  sharpness,  resulting  in  many 
more  branch-and-bound  nodes. 

In  Chapter  IV  we  provide  computational  support  of  a  common 
conjecture  that  the  proximity  of  the  convex  hull  of  the  integer  feasible 
solutions  to  the  set  of  integer  feasible  solutions  is  a  guide  towards 
problem  difficulty.  For  this  experiment,  we  use  the  Fixed-Charge,  and 
Fixed-Benefit  problems  mentioned  earlier.  The  Fixed-Charge  problem  is  a 
very  well-known  difficult  mixed-integer  problem.  On  the  other  hand,  it 
turns  out  that  the  Fixed-Benefit  problem  is  relatively  easy  to  solve  as  a 
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MIP.  The  Fixed-Charge  problem  usually  has  a  large  gap  between  the  two 
sets  mentioned  above,  while  the  Fixed-Benefit  problem  with  low  usage 
levels  (as  common  in  practice)  has  a  much  smaller,  nearly 
indistinguishable  difference  between  the  set  of  integer  feasible  points 
and  its  corresponding  convex  hull.  Our  computational  results  support  the 
stated  conjecture,  as  the  Fixed-Charge  problem  is  much  more  difficult 
than  the  Fixed-Benefit  problem. 

In  Chapter  V,  we  include  two  experiments  involving  the  modelling 
of  propositional  logic  problems  as  mixed-integer  programs.  We  find  these 
problems  extremely  easy  to  solve  using  elementary  modelling  techniques 
that  have  been  standard  since  the  1960*8.  Also,  we  can  show  an  advantage 
of  our  modelling  techniques  over  the  standard  ones. 

In  summary,  we  have  not  only  developed  theoretical  conditions 
concerning  the  existence  of  MIP  representations,  but  have,  in  doing  so, 
developed  an  automatic  modelling  for  every  bounded-MlP  representable 
problem  instance.  The  modelling  developed  is  always  sharp,  a  property 
which  we,  and  others,  have  found  to  greatly  improve  computational 
performance  in  a  Branch-and-Bound  mixed-integer  programming  system.  We 
provide  several  experiments  to  test  various  concepts  and  properties  of 
our  automatic  modellings,  and  MIP  modellings  in  general.  We  find 
Sharpness  and  hereditary  sharpness  as  essential  properties  for  successful 
model  building.  The  fact  that  our  modelling  is  completely  automatic,  and 
always  sharp,  greatly  enhances  its  use  as  a  modelling  technique. 
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CHAPTER  II 


MODELLING  WITH  INTEGER  VARIABLES 

Introduction 

The  interest  in  modelling  practical  and  mathematical  problems  as 
integer  and  mixed*integer  programs  begins  with  a  paper  by  Dantzig  lA]. 
Since  that  paper,  many  "standard  problem  formulations"  are  routinely 
available  in  elementary  textbooks.  However,  not  until  the  more  recent 
work  of  Meyer  <19],  [lOl  ,  Ill)  has  there  been  thorough  exploration,  or, 
perhaps,  even  awareness,  of  the  "limits"  of  integer  modelling.  Our  aim 

in  this  chapter  is  to  elucidate  some  of  the  subtleties  of  integer 
modelling. 

For  example,  the  fixed>charged  problem  can  be  modelled  without  a 
minimum  usage  constraint.  On  the  other  hand,  the  "fixed-benefit"  problem 
must  contain  a  minimum  usage  stipulation  (which  must  be  met  before  the 
benefit  is  received)  before  it  can  be  modelled  as  an  integer  problem. 

Our  results  explain  why  this  is  so. 

In  section  1  of  this  chapter,  we  find  that,  for  bounded  sets,  or 
functions  defined  on  bounded  domains  (Lemma  2.1.2),  the  subtleties  are 
neither  very  complex,  nor  restrictive.  However,  for  a  few  unbounded  sets 
the  restrictions  are  so  severe  that,  unless  the  unboundedness  can  be 
removed,  one  would  naturally  seek  a  different  mathematical  formulation  of 
the  sets.  For  example,  many  unbounded  functions  can  be  modelled  as 


lA 


generalized  linear  complementarity  problems,  which  have  no  integer 
modelling  (the  set  (x.y)  >  0  j  x*y  ■  O)  is  one  such;  see  Theorem  2.1.7). 

Even  with  the  .unboundedness  removed,  it  is  important  to  know  that 
a  bound  must  be  used.  This  kind  of  information  occurred  in  Meyer’s 
earliest  studies  of  integer  modelling  [9]  in  connection  with  the  fixed- 
charge  problem  (which  is  representable  only  in  its  bounded  form,  when 
rational  data  are  stipulated).  This  choice  of  a  suitable  bound  can  be  a 
major  issue  in  algorithmic  implementation. 

In  section  1,  we  develop  basic  definitions  and  many  corollaries 
which  lead  to  the  heart  of  our  investigations  into  set  representabiliCy . 
Theorem  2.1.7  describes  necessary  and  sufficient  conditions  for  the  union 
of  representable  sets  to  be  MIP-representable.  In  section  2,  we  explore 
Meyer’s  concept  of  bounded  representability,  and  develop  necessary  and 
sufficient  conditions  for  bounded  MIP-representability  (Theorem  2.2.1). 
For  bounded  representable  sets,  our  results  guarantee  one  representation 
in  which  all  integer  variables  are  binary  variables;  furthermore,  these 
binary  variables  occur  in  a  common  set-partitioning  constraint. 

We  then  focus  upon  the  implementation  of  MlP-representations ,  and 
ask  how  "accurate”  the  representations  are  after  the  integer  variables 
are  "relaxed"  to  continuous  (rational)  variables,  as  in  branch-and-bound 
algorithms  (section  3).  We  easily  show  that  the  best  possible  relaxed 
MIP-representation  contains  the  convex  hull  of  the  original  set.  When 
the  specific  MIP-representation  is  exactly  the  convex  hull,  it  is  called 
a  "sharp"  representation.  We  provide  sharp  representations  for  many 
models;  most  representations  in  common  use  are  sharp. 
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In  our  last  section  of  this  chapter  (section  4),  we  show  that 
Meyer's  bounded-MIP  representabi' ity  Theorems  [9]  are  specific  cases  of 
our  Theorem  2.1.7  when  all  variables  are  restricted  to  the  rational 
field.  We  then  apply  a  linear  inequality  MIP-representation  for 
polyhedra  developed  in  section  1  to  an  example  representation  used  by 
Beale  [3],  and  show  that  the  two  can  be  used  interchangeably  and  chat 
both  are  "sharp"  representations.  Moreover,  it  typically  occurs,  when  we 
turn  our  representability  techniques  to  problems  treated  earlier,  that  we 
obtain  the  most  efficient  (fewest  constraint)  linear  relaxation,  even 
while  our  relaxation  is  best  possible  (i.e.  "sharp").  To  do  so  may 
require  noting  some  algebraic  simplifications  for  the  specific  problem, 
but  that  is  all.  Our  problem  formulations  are  either  derived  from  or 

motivated  by  disjunctive  programming  fl],  (7]. 

While  Meyer's  theory  permits  representations  which  involve 
irrational  data,  we  have  restricted  ourselves  to  rational  representations 
only.  This  appears  to  cover  all  that  is  of  practical  interest,  and  so 
our  results  are  stated  for  the  rational  field  only.  In  some  cases,  the 
results  extend  to  the  real  field,  but  such  issues  can  be  addressed  later 
if  they  seem  of  interest. 

Throughout  this  thesis,  we  denote  the  convex  span  respectively  the 
closure  of  set  S  as  conv(S)  respectively  cl(S),  and  clconv(S)  is  the 
closure  of  conv(S)  [12].  Similarily  cone  (S)  denotes  the  convex  cone 
generated  by  S.  The  convex  span  and  cone  operations  are  taken  only  with 
rational  multipliers. 
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We  assume  that  the  reader  is  familiar  with  the  intended  use  of 
MIP-representability  in  problem  formulations,  as  illustrated  in  Meyer's 
papers. 

Section  1:  Representability  of  the  Finite  Union  of  Representable  Sets 
We  define  a  set  S  &Q  (where  Q  denotes  the  rational s)  as  MIP** 
representable  if  there  are  rational  matrices  Aj ,  A2,  A^,  and  vector  b 
with  the  property  that:  x  c  S,  if  and  only  if,  for  some  u,  v  >0,  with 
u  integer,  we  have 


^1*  *  ^2“  *  ^3'^  * 


(2.1.1) 


We  note  that  our  definition  extends  to  the  case  where  x  is  constrained  to 
be  integer.  I.e.,  let  x  ■  (x^  ,...,  x^)  and  let  I  be  any  subset  of 

{1  ,...,  n};  then  if  S  is  MIP-representable  so  is  the  set  {x  e  S(x^  e  Z 

(integers)  for  all  i  e  1}  (this  is  an  easy  exercise).  An  extension  of 
our  definition  of  MIP-representation  is  to  functions;  f  is  MIP- 
representable  if  the  set  epi(f)  is;  where  epi(f)  ■  {(z,x)|  z  >  f(x)  and 
f(x)  is  defined)  denotes  the  epigraph  of  f.  We  also  note  that  f  has  a 

rational  MIMM  on  T  in  Meyer's  sense  [10]  and  only  if  f  is  MIP- 

representable  in  the  above  sense. 

Our  first  theorem  is  an  extension  of  the  finite  basis  theorem  for 
polyhedra  (Rockafellar  [12]). 

Theorem  2.1.1 

Suppose  that  S  ^  0  is  MIP  representable.  There  are  constants  a. 
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b  >  0  such  that  for  all  integers  p  > 


S  .  e  0  and 
a+bp 


S 


g 

a+bp 


+  pM, 


(2.1.2) 


where 


n 

»  {x  £  S  j  I  I  I  <  oj,  and 


(2.1.3) 


M  »  {x  E  Z 


n 


for  some  u,  v  >  0,  with  u,  v  integer, 
AjX  +  A^u  ♦  A^v  ■  O). 


(2.1.4) 


Proof 


5 

a+bp 
X  ♦  px 


The  containment  S  +  pM  is  immediate  once  we  show  that 

*  0.  In  fact,  if  x  e  S  and  x'  c  M  then  by  a  direct  computation 
'  c  S. 

We  note  that  the  nonempty  set  P,  defined  as 


» 


P  •  {(x,u,v)j  AjX  +  A^u  +  A^v  ■  b;  u,  V  >  0}  (2.1.5) 

is  a  polyhedron.  Therefore,  from  the  finite  basis  theorem,  there  are 

disjoint  finite  sets  J  and  K  and  points  (x  ,  u^,  v^)  of  P  for  t  e  K  with 

P  “  conv(  {(x\  u\  v'^)  I  j  e  J))  ♦  (2.1.6) 

cone(  {(x*^,  v^)  ]  k  e  K)). 
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j  It  o  o 

We  define  a'*  ■  max  lx  I,  and  b  ■  Z  lx  I,  and  a  *  max  {a  ".lx  1}  for  x  c  S. 

kcK 

(Note,  Ixl  -  Ixjl+lx^l  +...♦1x^1  if  X  -  (xj,  x^  .  x^)).  Since  b  >  0 

o 

for  p  >  1  we  have  x  e  Thus  S^_^jjp  *  0  and  we  now  must  establish 

the  containment  S  c  S  pM. 

a-fbp 

k  k  k 

We  may  assume  in  what  follows,  that  (x  ,  u  ,  v  )  is  an  integer 
vector  for  k  e  K  (recall  that  A^,  A^,  A^,  b  are  rational).  In 
•  ,  ^ 

particular,  x  c  M  for  k  e  K. 

Lat  X  as.  Then  for  some  u,  v  ^0  with  u  integer,  we  have 
(x,u,v)  c  P.  Therefore  there  is  a  solution  to 


(x,u,v)  ■  Z  A.(x-^,  u-’,  v-’)  ♦  Z  X.  p(x'‘,  u*^,  v*^),  (2.1.7) 

jeJ  ■'  keK 

!■  I  X.  -,  X,>0,  jeJ,  T  >0,  kcK. 
jeJ  J  J  ^ 


Let  q  denote  the  integer  part  of  t  and  let  f  be  the  fractional  part, 
so  that  \  “  qj^  ♦  q^.  >  0.  Define 


(x',  u,  v')  »  Z  X-fx'^,  u'^,  v'^)  +  Z  f,  pCx'^,  u'^,  v'^)  (2.1.8a) 

jcJ  kcK 


*  *  *  k  k  k 

(x  ,  u  ,  V  )  -  Z  0  p(x  ,  u  ,  V  ).  From  (2.1.8a),  (2.1.8b) 

kcK 


* 

so  we  have  x  *  x"  ♦  x  . 
Since  (x,u,v)  • 


We  must  show  that  x' 


* 

^  ®a4bp  * 


(x",  u',  v")  ♦  (x  , 


it  * 

u  ,  V  )  and  u  and 


c  pM. 

* 

u  are 


integer  vectors,  u'  is  an  integer  vector.  From  (2.1.8a),  (x',  u'.v-*) 
P.  Hence  x.'  e  S.  Also  trom  (2.1.5),  and  (2.1.8a), 
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k  k 

Ix'l  <  a'  ♦  p  Z  fj^lx  I  <  a'  ♦pb.  Thus  x'  e  Since  x  e  M  and 

* 

is  integer  for  k  e  K,  we  have  x  e  pM  from  (1.8b). 

Q.E.D. 

We  note  that  2.1.1  defines  any  representable  set  S  in  terms  of  a 
bounded  part  (S  .  )  and  a  discrete  unbounded  part  (pM)  as  in  the  finite 
basis  theorem  for  polyhedra.  For  ®  ^  Q,  is  MIP-representable;  hence 
Sa^bp  representable.  In  addition,  M  is  an  integral  monoid  with  a 
finite  basis  (see  Jeroslow  (8]). 

Corollarv  2.1.2 


If  S  is  both  bounded  and  MIP-representable,  S  is  a  finite  union  of 


polytopes. 


Proof 


It  suffices  to  show  that  a  bound  can  be  placed  on  the  vector  u 


occuring  in  the  representation. 

Assume  that  x  c  s.  From  the  proof  of  Theorem  2.1.1,  and  since  S 

k 

has  no  recession  directions,  we  have  x  »  0  for  k  e  K.  Hence  in 
(2.1.8a),  x'  •  X  and  A^x  ♦  A2U'’  A^v'  *  0  with  u",  v"  >0  and  u' 

integer,  (recall  x  t  S)  A  bound  on  lu'l  is  max  lu"*  I  *  I  lu  I  directly 

j  cj  kcK 


from  (2.1.8). 


Q.E.D. 
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Corollary  2»1 »3 

If  S  is  representable,  conv(S)  is  a  polyhedron. 

Proof 

Apply  (2.1.2)  for  p*l.  Since  M  is  a  monoid  we  have 

conv(S  +  M)  «  conv(S  ,)  +  cone(M). 
a't'b  a+b 

To  establish  (2.1.9),  note  that  clearly 

conv(s^^jj  +  M)  c.  conv(S^^j^)  ♦  cone(M)  ( 

is  valid.  To  prove  that 

conv(S^^j^  +  M)  5  conv(S^^j^)  ♦  cone(M),  ( 

we  have,  for  any  v  e  conv(S  )  +  cone(M), 

a+b 

V  *  rX-s"^  ♦  Z  u.m.  ,  (, 

j  J  k 


where  I  A j  ■  1 ,  A^  >  0,  >  0,  e  M,  and  s^  c 


a+b 


let  A^  >  0  (WLOG),  then  we  have 


m  ■  Z  c  cone(M),  and  also 


(2 


(2.1.9) 


2.1.10) 


2.1.11) 


.1.12) 


.1.13) 
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V  -  A,(8'  ♦  m)  +  Z  X.(8^  ♦  0).  (2.1.14) 

^  3*1  J 

But  since  (s'  +  m)  e  S  .  *  cone(M),  then  v  c  conv(S  .  ♦  cone(M)). 

a+b  *  a+b 

Furthermore,  if  w  ■  s  ♦  w',  where  w'  e  cone(M)  and  s  c  then  for 

some  integer  D  >  1,  Dw'  e  M,  and  by  factoring, 

8  +  w'  -  1/D(8  ♦  Dw')  ♦  l/D(s+0)  +...+  l/D(s+0)  (2.1.15) 

where  (s  ♦  Dw')  e  S  ^  ♦  M  and  (s  +0)  e  S  ^  ♦  M.  Therefore 

a+b  a+b 

(s  ♦  w")  e  conv(S  ,  +  M),  and 
a+b 

S  .  ♦  cone(M)  cconv(S  .  +  M).  (2.1.16) 

a+b  —  a+b 

Finally, 

conv(S  •)  +  conelM)  cccpv(S  ,  +  cone(M))  Cconv(S  .  ♦  M).  (2.1.17) 

a+b  a+b  a+b 

which  proves  (2.1.9) 

By  Corollary  2.1.2,  conv(S^^^)  is  a  polytope.  Since  M  has  a 
finite  basis,  cone(M)  is  a  polyhedral  cone.  By  the  converse  to  the 
Finite  Basis  Theorem  (Rockafellar  [11]),  conv(S)  is  a  polyhedron. 


Q.E.D. 
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Corollary  2.1.4 

S  is  an  MIP-representable  set  if  and  only  if  there  is  a  bounded 
MIP-representable  set  S'  and  an  integral  monoid  M  with  a  finite  basis 
such  that  S  *  s'  ♦  M. 

Moreover,  if  S'  is  an  MIP-representable  set  (bounded  or  not)  and  M 
is  an  integral  monoid  with  finite  basis,  then  S  *  S'  M  is  MIP- 
representable. 


Proof 


The  "only  if"  is  established  by  Theorem  2.1.1. 

If  S  *  S'  M,  let  Aj,  A^,  Aj,  and  b  be  rational  matrices  and 


vector  such  that 


if  X  e  S'  for  some  u,  v  >  0,  u  integer, 


(2.1.18) 


AjX  ♦  A^u  ♦  AjV  -  b, 


and  let  A^  be  an  integral  matrix  such  that 


m  e  M  ••  for  some  w  >  0  integer 


(2.1.19) 


m  ■  A,  w. 
4 


Then  we  have, 


X  c  S  -  there  are  x^  and  x^,  u,  v,  w  >  0  with  u,  w  integer 


that  X  ■  Xj  ♦  x^,  AjXj  ♦  A2U  AjV  ■  b,  and 


V  •  *2* 


(2.1.20) 
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vhich  i«  an  MIP-reprcsentstion  of  S. 

q.e.d. 

The  next  result  is  a  technical  corollary  that  produces  a  necessary 
condition  for  HIP-representability  using  recession  directions. 

Corollary  2.1.5 

*  n 

Suppose  S  IS  MlP-representable  and  x  c  Q  is  a  vector  for  which 
0 

there  exists  some  x  e  S  and  a  sequence  \  '*''*'  **  vith 

o  * 

X  T|^x  e  S  for  all  k«l,2,  ...  (2.1.21) 

Then  there  is  an  o  >  0,  o  e  Q,  with  the  property  that  for  all 
X  e  S  and  all  integers  p  >  0, 

* 

X  ♦  pox  e  S,  and  (2.1.22) 

ox  c  z"( integers) . 

A 

Rmk:  The  conditions  (2.1.21)  are  sometimes  abbreviated  by  saying  that  x 

o 

IS  a  local  recession  direction  of  S  at  x  . 

Proof 

k 

By  Theorem  2.1.1,  for  each  k«l,2,...,  there  exists  x  c  and 

k  o  *  k  k  k 

m  e  cone(M)  with  x  ♦  Tj^x  ■  x  Tj^m  ,  where  Xj^m  e  M  (so  that 

k 

m  e  cone(M)).  Then  note 


J’.I  J||>  '.'•  'A  '•'' 


na 
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I 


>•. 


/-•  V 


*  k  k  o 
lx  -  n  I  <  lx  -  X  I/t.  <  B/t.  , 


(2.1.23) 


vhere  B  is  a  bound  independent  of  k. 

* 

Since  we  have  x  e  clcone(M)  ■  cone(M),  as  cone(M)  is 

* 

finitely  generated  [8].  Hence,  for  some  o  >  0,  we  have  ok  e  M,  which 

*  n  .  * 

implies  ox  e  Z  .  Finally,  from  Theorem  2.1.1,  x  poc  e  S  whenever  x  e  S 

and  p  >  0  is  integer. 

Q.E.D. 

Example  2.1.1 

Let  S  ^  Q  be  an  MIP-representable  set  defined  by  S  »  tx  £  Z  |  x  ■ 

0  or  X  >  2}.  An  MlP-representation  of  S  is 


x  e  S  there  exist  u^,  U2  >  0,  u^,  U2  integer  such  that  (2.1.24) 


X  -  2u^  -  3u2 


0. 


*  0 

with  X  ~  1,  X  *  2,  the  hypotheses  of  Corollary  2.1.5  are  met  by  letting 

•  k.  The  conclusion  also  holds  if  0*2.  It  is  important  to  note 

e 

that  we  cannot  satisfy  the  corollary  merely  by  making  ox  an  integer. 

It 

E.G.,  0*1  does  not  satisfy  x  ♦  ox  p  e  S  for  p»l  and  x*0.  Thus  even 
integer  local  recession  directions  need  not  be  global  recession 
directions  (i.e.  need  not  satisfy  (2.1.22)). 

Suppose  we  are  given  representable  sets  .  S^.  Trivially, 

their  intersection  is  representable,  but  what  conditions  must  hold  for 
their  union  (S^  U  U  •••  U  S^)  to  be  representable?  The  next  fixed- 


charge  example  illustrates  some  complexities  of  this  question. 
Example  2.1.2 


25 


Let  f  be  the  fixed-charge  function 


f(x)  ■  j  1  ,  X  >  0  (2.1.25) 

^0  ,  X  •  0. 

Note  that  epi(f)  *  U  ^2  '*^**^*  polyhedra  (thus 

representable)  and  defined  by 


Sj  ■  {(2,x)|z  >  0,  X  -  0}  (2.1.26) 

$2  ■  {(*,x)|*  >  1,  X  >  0). 

e  * 

However,  $2  (J  Sj  is  not  representable.  In  fact,  with  (t  ,  x  )  »  (0,1) 

0  0 

and  (s  ,  X  )  ■  (1,0)  the  hypotheses  (2.1.21)  of  Corollary  2.1,5  hold  for 
S2.  But  the  conclusion  (2.1.22)  fails  since  (0,0)  e  epi(f)  and 
(0,0)  ♦  y(0,1)  ■  (0,y)  /t  epi(f)  for  any  Y  >  0. 

In  the  above,  a  recession  direction  of  S2  (namely  (0,1))  fails  to 
be  a  global  recession  direction  of  S2.  However,  this  is  not  the  relevant 
feature  of  our,  example,  for  the  precise  relative  placement  of  Sj  with 
respect  to  $2  (and  not  merely  recession  directions)  also  makes  a 
difference.  E.G.,  if  S'  ■  {(-*,1)  |  z  >  l)  then  (0,1)  is  not  a  recession 
direction  of  Sj  either,  but  Sj  (J  $2  is  MlP-representable  (exercise). 
Because  of  the  relative  placement  of  $2,  and  $2,  we  see  that  (0,1)  is 


.‘.v 


.  .'TV-. 


V'  .vf  •.  I.'  a.*.  O.  m-’  . 
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not  •  global  racatsion  diraction  of  S  ■  (hanca  S  it  not 

raprasantable) ,  yat  (0.1)  ^  a  recession  direction  of  S'  “  Sj  U  82.  This 
turns  out  to  be  the  key  feature  for  MIP-representability  (see  Theorem 
2.1.7  below). 

The  simple  construction  of  the  linear  system  (2.1.27)  in  our  next 
result  will  play  an  important  role  in  our  results  to  follow.  This 
construction  derives  from  disjunctive  methods  [1],  [7]. 

Lemma  2.1.6 

If  S^  »...»  are  bounded  MlP-representable  sets,  then  their 
union  is  MlP-representable. 

Proof 

By  Corollary  2.1.2,  it  suffices  to  show  that  any  union  of 
polytopes  is  MlP-representable,  since  each  is  a  union  of  polytopes. 

Let  P^  ■  {x|a*^x  >  b  }  be  non-empty  polytopes  for  i*l,...s.  Note 
that  the  boundedness  property  implies  that  Ax>0-x*0.  A 
representation  of  P^  y  ...  IJ  given  by 

s  . 

X  -  Z  x^;  A^x*"  -  b^  >  0,  >  0  for  i-l,...,s  (2.1.27) 

8 

Z  X.  ■  1  and  X.  integer  for  i~l,...,s. 
i-1  ‘  ‘ 

Q.E.D. 

Before  presenting  our  main  theorem,  we  must  formally  define  our 
use  of  recession  directions.  Our  definition  is  restricted  to  discrete 
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recession  vectors  and  from  this  point  on,  any  mention  of  recession 
directions  implies  discrete  vectors.  (Note  this  restriction  has  no 

effect  upon  corollary  2.1.5). 

* 

An  integer  vector  x  is  a  discrete  recession  direction  of  a  set  S 

* 

if  for  all  X  e  S  and  integers  p  >  0,  we  have  x  ♦  px  e  S.  The  set  of  all 
recession  vectors  is  denoted  rec(S). 

Note  that  rec(S)  is  dosed  under  addition,  and  integer  multiples, 
and  0  c  recCS);  thus  rec(S)  is  an  integer  monoid.  [8] 

Lemma  2.1.6  is  still  valid  if  each  polyhedron  gives  the  very 
same  continuous  recession  directions  {x|a  x  >  0),  independent  of  i.  In 
this  case,  the  ^  0  need  not  be  bounded.  The  same  proof  of  lemma 
2.1.6  justifies  this  claim.  In  particular,  if  each  function  f^  is 
polyhedral  on  a  nonempty  domain  (dom(fj^)  ^  0)  and  the  recession 
directions  of  all  sets  domCf^)  are  the  same  for  i  e  I(i),  then  the 
function 


f{x)  »  min{f^(x)|x  c  dom(f^)) 


(2.1.28) 


is  MIP-representable. 

We  next  state  our  main  result  on  when  the  finite  union 
representable  sets  is  representable. 

Theorem  2.17 

Suppose  that  ,...,  are  MIP  representable  non-empty  sets. 
Then  S  ■  Sj  (J  . . .  IJS^  is  MIP-representable  if  and  only  if  every 
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discrete  recession  direction  of  every  has  a  positive  multiple  which  is 
a  discrete  recession  direction  of  S. 

Proof 

* 

A  recession  direction  x  of  ^  0  satisfies  the  hypothesis  of 

Corollary  2.1.5.  The  conclusion  of  Corollary  2.1.5  must  hold  if  S  is 

MIP-representable,  hence  our  conclusion  is  necessary. 

Next  suppose  our  condition  holds;  we  must  show  that  the 

corresponding  S  is  MIP-representable.  Let  the  sets  and  M*”  and 

i  i*^ 

constants  a-,  b-  >  0  be  defined  for  S.  as  in  Theorem  2.1.1.  From  (2.1.4) 
i’  1  1 

each  integer  monoid  is  finitely  generated  and  therefore  has  a  set 

of  generators  a^^j  e  R  for  j  e  l(i),  where  I(i)  is  an  index  set.  Each 

a^j  is  a  discrete  recession  direction  for  S^,  and,  from  our  condition, 

o.,a..  c  rec(S)  for  some  a..  >0.  Since  a...  c  Q,  we  may  take  o .  .  c  2  by 

ij  xj  xj  xj  '<»  /  ij 

clearing  denominators.  Then  o.  ■  II  o.  .  is  an  integer  and 

"  jcKi) 

o.M.  Crec(S).  Furthermore,  from  Theorem  2.1.1  we  have 

X  X 


^a.+b.o.  ®i**i 

X  XX 


i“l . t 


(2.1.29) 


In  fact,  we  have 


S 


s  )  ♦ 

a  .  ♦b  .  0, 
XXX 


t 

(  I 
i*l 


(2.1.30) 
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The  containment  (c)  in  (2.1.30)  is  trivial,  since  if  x  c  S  ve  have  x  e 
for  some  i*l . .  and  (2.1.29)  holds.  As  to  the  contrinment  (p)  in 

(2.1.30),  let  X  ■  X**  ♦  X  ,  with  x°  c  U  .  and  x  e  E  o  M.. 

’  ’  .  ,  a.+b.o.  .,11 

1*1  111  1*1 

Since  each  fc  S.  ,  we  have  x°  c  S.  Since  o.M.  C  rec(S),  we  have 

a.*b.o.  1*  11' 

* 

X  e  rec(S).  Thus  x  e  S. 

Each  set  S*'  .  is  a  bounded  set,  and  MIP-representable;  thus  by 

a  .♦t*  .0. 

1  11 

Lemma  2.1.6,  the  union  is  representable.  Since  each  is  finitely 
t 

generated,  so  is  I  o.M..  By  applying  the  necessary  condition  of 
i-1  ^  ‘ 

corollary  2.1. A  to  (2.1.30),  S  is  MIP-representable. 

Q.E.D. 

From  Theorem  2.1.7,  we  may  determine  representability  of  any 
finite  collection  of  representable  sets  using  discrete  recession 


directions.  Moreover,  from  the  proof  of  Theorem  2.1.7,  it  is  not 

necessary  that  every  discrete  recession  direction  of  have  a  positive 

multiple  that  is  in  r€c(S);  we  only  need  o^M^  C  rec(S)  for  a  suitable 

integer  >  1,  where  M^  is  the  integer  monoid  of  (2.1.4).  One  easily 

proves  that  M^  &  rec(S£),  but  the  converse  is  false,  as  our  next  example 

shows.  This  example  also  illustrates  the  fact  that  M^  depends  not  only 

on  S  ,  but  also  on  a  specific  representation  for  S.. 
i 


Example  2.1.3 


A  representation  of  the  set  S  ■  {x[x  >0}  is 
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X  ■  X,  ♦  2x_,  0  <  X.  <  2,  X  >  0,  X.  e  Z.  (2.1.31) 

1  2  12  2 

Using  (2.1.4)  to  determine  M,  we  find  that 

M  ■  {x  •  2x^1 >  0,  £  Z),  (2.1.32) 

As  x^  ■  0  is  forced  in  M.  Ue  know  that  a  recession  direction  of  S  is 

*  * 

x  “1,  but  in  this  representation  x  ■  1  ^  M.  (However,  whenever  our 

MIP-representation  results  in  a  x  c  rec(S),  but  x  ^  M,  we  will  always 

■k 

have  some  o  >  0,  with  ox  e  M.)  If  '2'  is  replaced  by  *1'  as  a 
coefficient  in  (2.1.31),  the  resulting  M  is  indeed  all  of  the  recession 
directions.  An  alternate  form  of  Theorem  2.1.7  requires  the  cone  of  the 
recession  directions  of  the  set  S  to  be  equal  to  the  union  of  the  cones 
of  recession  directions  of  each  S^. 

Corollary  2.1.8 

Let  Sj  ,...,  be  MIP-representable  and  put  S  •  Sj U  . . .  (j  . 
Then  S  is  MIP-representable  if  and  only  if 

t  t 

cone(rec(S))  ■  U  cone(rec(S  • ) )  *  Z  cone(rec(S. ))  (2.1.33) 

i-1  ^  i-1  ‘ 

In  particular,  if  S  is  MIP-representable,  the  right-hand-side  of  (2.1.33) 
is  a  cone. 

Proof ; 

By  Theorem  2.1.7,  S  is  MIP-representable  if  and  only  if 
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rec(S^)  c  coneCrecTs) }  for  (2.1.34j 

(Recall  that  the  cone  operation  is  only  with  respect  to  rational 
multipliers) . 

*  o 

Suppose  S  IS  representable  and  x  t  rec(S).  Fix  any  x  e  S.  Then 

0  * 

there  is  some  such  that  x  ♦  qx  ^  for  infinitely  many  q  >  0.  By 

* 

Corollary  2.1.5,  we  also  have  ox  c  recCS^)  for  at  least  one  o  >  0  and 
i  e  {1  ,...,  t}.  Thus, 

t 

rec(S)c  U  cone  {rec(S  .  )  ).  (2.1.35) 

i-1  ^ 

Upon  taking  the  cone  generated  by  each  set  in  (2.1.34)  and  applying 
(2.1.35),  we  find 


t 

rec(S)  c  U  cone {rec(S .))  E  cone {rec(S) ).  (2.1.36) 

i-1  ^ 


We  now  show  that  C  •  cone {rec(S . ) }  is  a  cone.  This  will  also 

i‘l  ^ 


demonstrate  that  C  -  I  cone{rec(S. )}. 

i-1  ^ 

In  fact,  if  x,  y  e  C  then  for  certain  j,  and  k,  we  have 


X  e  cone {rec(S . ) )  and  y  c  cone{rec(S  )}.  By  (2.1.39),  x,  y  c 
J  k 

cone{rec(S) },  so  x  +  y  €  cone{rec(S) );  i.e.,  for  some  o  c  Q,  we  have 
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o(x+y)  e  rec(S).  By  the  first  containment  in  (2.1.36),  for  some  p  we 
have  o(x+y)  t  cone trec(Sp))  and  hence  x+y  t  cone  (rec(Sp) }.  As  C  is 
closed  under  addition  (and  clearly  closed  under  positive  multiples),  C  is 
a  cone. 

By  applying  the  cone  operation  rec(S)  in  (2.1.36)  we  obtain 
(2.1.33)  when  S  is  representable. 

Suppose  (2.1.33)  holds,  it  implies  cone {rec(S^) )  c  cone {rec(s) ) 
for  i*l,...,t,  in  which  case  (2.1.34)  follows,  forcing  S  to  be 
representable. 

Q.E.D. 

Note  that,  as  we  saw  in  the  fixed-charge  example  (Example  2.1.2), 
it  is  not  sufficient  for  the  r.h.s.  of  (2.1.33)  to  be  a  cone.  But  if  the 
COT etrecfS^)}  is  independent  of  i»l,...,t,  the  result  holds. 

Corollary  2.1.9 

If  conelrecfS^))  is  independent  of  i“l,...,t,  and  each  S.  is 
representable,  then  S  »  S^U  ...  IJ  is  representable. 

Proof: 

* 

Fix  1*1,. ..,t.  If  X  e  cone (recCS^) ),  then  for  all  j*l,...,t  we 

It 

have  X  c  cone( rec (S j ) ) .  Hence,  for  any  j  there  is  some  integer  >  0 

* 

for  which  we  have  o-x  c  rec(S.). 

J  J 

* 

Define  o  •  H  Oj.  Then  ox  t  rec(S;)  for  all  j*l,...,t.  Thus 
i*l  ^ 

* 

ox  c  rec(S),  and  the  necessary  condition  of  theorem  2.1.7  is  verified. 


Q.E.D. 
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We  note,  from  example  (2.1.2),  that  the  sufficiency  condition  of 
Corollary  2.1.9  is  not  necessary. 

Proposition  2.1.10 


Proof: 


If  S  is  MIP-representable,  then  S  is  closed. 


Suppose  x”  e  S  for  n*l,2,...  and  x"  ♦  x°.  With  o=  lx°  i  +  1,  S 


is  a  bounded  representable  set.  By  Corollary  2.1.2,  is  a  finite  union 

n  o 

oi  polytopes,  and  thus  closed.  Since  x  c  for  large  n,  x  e  S  c  s. 


Q.E.D. 


Example  2 . 1 .A 


The  "fixed-benefit  function"  f  given  by 


fo  •  -  ’  " 

1-1  .  X  >  0,  X  <  M 


for  either  M  finite  or  infinite,  is  not  MIP-representable,  since  epi(f) 
is  not  closed  and  Proposition  2.1.10  applies.  (Note  that 
(-1,  — )  e  epi(f)  for  each  n  but  (1,0)  ^  epi(f)).  (N.B.  In  the 
"minimizing  format"  we  have  assumed,  cost  is  minimized,  so  that  a  benefit 
or  profit  shows  up  as  a  negative  cost.  If  one  switches  to  a  maximizing 
format,  the  difficulties  will  remain). 

By  the  kind  of  reasoning  as  in  the  example,  one  easily  establishes 
the  known  result  that  a  representable  function  is  lower  semicont inuous . 


34 


Example  2 . 1 «5 

The  "fixed-benefit  function  with  minimum  usage  level"  6,  is  given 
by 


f(x) 


,  X  -  0; 

,  5  <  M. 


For  M  >  6  >  0  ,  and  M  finite,  f(x)  is  MlP-representable.  In  fact,  one 
easily  shows  that  epi(f)  is  the  union  of  two  polyhedra  which  have  (1,0) 
as  their  sole  recession  direction,  and  our  earlier  remarks  apply.  In 
fact,  by  Theorem  2.2.1  and  remarks  to  follow,  f  is  representable  using 
only  binary  integer  variables  in  a  set-partitioning  constraint. 

(Actually,  we  can  have  M  infinite  and  f  will  be  representable;  this  is  a 
somewhat  harder  exercise). 

We  conclude  this  section  with  a  necessary  and  sufficient  condition 
for  a  set  S  to  be  MIP-representable. 

Theorem  2.1.11 

A  set  S  is  MlP-representable  if  and  only  if  S  has  the  form 


S  *  (  U  P.)  ♦  M  (2.1.36) 

i*l  ^ 


for  a  finite  set  of  polytopes  Pj  ,...,  P^  and  an  integer  monoid  M  with 
finite  basis. 


Proof: 


Necessity  follows  from  Corollary  2.1.4  and  then  Corollary  2.12. 

t 

Sufficiency  follows  by  Corollary  2.1.4,  since  S'  •  UP-  is  MIP- 

i*=l  ^ 

representable  (recfP^)  •  {0}  as  each  P^  is  a  polytope,  and  e.g.  Corollary 
2.1.9  can  be  used). 

Q.E.D. 

Section  2:  Bounded-integer  Representability 

In  this  section  we  explore  Meyer's  concept  of  bounded 

representability.  We  define  S  as  bounded  MlP-representable  if  there  are 

0 

ra:ional  matrices  ,  A2,  A^  end  a  vector  b,  plus  a  vector  bound  u  such 
th.it 


0 

X  e  S  «•  there  are  u,  v  >  0,  u  integer  and  u  <  u  ,  and  (2.2.1) 

AfX  A^u  +  A^v  «  b. 

Note  that  S  need  not  necessarily  be  bounded  as  a  set,  in  order  to  be 
bounded  MIP-representable;  e.g.  we  can  have  S  ■  P^  U  P^  where  P^  and  P2 
are  unbounded  polyhedra  with  rec(Pj)  •  rec(P2). 

Theorem  2.2.1 

S  is  bounded  MIP-representable  if  and  only  if: 

(i)  S  is  a  finite  union  of  polyhedra;  and  also 

*  n 

(ii)  The  following  condition  is  satisfied  for  every  vector  x  e  Q  : 

o  0  * 

If  there  exists  x  e  S  and  ♦  “with  x  Tj^x  c  S  for  k“l,2.... 
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* 

then  for  all  T  >  0  (t  e  Q)  and  all  x  e  S,  we  have  x  4  tx  e  S. 

Remark : 

The  condition  in  (ii)  is,  in  words,  that  every  local  recession 
direction  of  S  is  also  a  continuous  global  recession  direction  of  S. 
Proof: 

First  suppose  S  is  bounded  MlP-representable .  Clearly,  S  is  a 

union  of  polyhedra. 

*  ^  .  o 

When  X  #  0  is  such  that  there  exists  an  x  e  S  and  \  *  with 

o  * 

X  TjjX  e  S  for  all  k“l,2,...,  then  there  exist  uj^,  Vj^  >0  with  uj^ 
integer  such  that 

/  0  o 

Aj(x  ♦  Tj^x  )  4  A2Uj^  4  AjVj^  *  b,  and  <  u  for  all  k.  (2.2.2) 

* 

Dividing  both  sides  of  (2.2.2)  by  I  (  Tj^x  ,  Uj^,  Vj^)  |  ♦  4  •  and  using 

* 

compactness  we  find  that  there  exists  v  >0  with 

AjX*  4  A3V*  -  0.  (2.2.3) 

*  * 

By  rationality  of  Aj ,  A^,  x  ,  we  may  assume  v  is  rational.  By  a  direct 

* 

computation,  x  4  xx  c  S  whenever  x  c  S  and  p  >  0. 

For  the  converse,  suppose  that  S  is  a  finite  union  of  nonempty 
polyhedra  S  ■  Pj  U  ...  U  the  condition  holds.  If  we  utilize  the 


notation  *  {x  |  A  x  >  then  (2.1.27)  provides  a  bounded 

representation  of  S. 

In  fact,  let  the  set  represented  by  (2.1.27)  be  denoted  S”". 

For  i>l,...,s  by  putting  ■  1  and  Xj  »  0  for  j  +  i,  x'^  *  0  for  j  ♦  i, 

we  see  that  c  s'.  Thus  S  ^  S'.  Next,  let  x  e  S',  so  that  (2.1.27) 

i 

holds  for  certain  x  and  X^.  For  some  k*l,...,s  we  have  Xj^  ■  1  and 

X^  “  0  for  i  +  k.  Without  loss  of  generality,  k*l.  If  i  ♦  1  and 

i  i  i  o  i  ^  0 

Ax  -  b  X^  >  0,  then  x  +  tx  ^  p  >  0  whenever  x  e  pr.  By 

the  hypothesis  (ii),  x  tx^  e  S  whenever  x  e  S  and  t  >  0.  We  have  x^  e 

12  12  3 

S;  hence  x  x  eS;  hence  x  -t-  x  •*-  x  e  S;  etc.  In  this  manner,  we 

establish  that  x  ■  Zx*"  e  S.  Hence  S'  ^S.  We  conclude  that  S  “  S',  as 

desired. 

Q.E.D. 

Theorem  2.2.1  excludes  non-trivial  integer  monoids,  as  well  as 
many  other  unbounded  representable  sets,  from  being  bounded  MIP- 
representable. 

The  proof  of  the  "sufficiency”  part  of  Theorem  2.2.1  reveals  that, 
when  S  is  bounded  MlP-representable,  it  has  at  least  one  representation 
(2.1.27)  in  which  all  the  integer  variables  ocurring  are  binary  variables 
that  appear  in  the  same  set-partitioning  constraint  (i.e.  ZX^  *  1). 
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Section  3;  Sharpness  of  HepTfesentations 

Another  property  of  MlP-representations  involves  the  "relaxation" 
of  the  nonnegative  integral  variables  u  of  (2.1.1)  to  nonnegative 
rational  values. 

Our  interest  in  this  relaxation  lies  in  the  fact  that,  in  branch- 
and-bound  algorithms,  this  relaxation  is  utilized  in  the  subproblems 
formed.  Obviously,  it  is  desirable  for  this  relaxation  to  be  "as 
accurate  as  possible."  We  next  prove  that  the  relaxed  representation 
contains  the  convex  hull  of  the  original  set  S.  This  fact  places  a 
substantial  limitation  on  the  accuracy  of  the  relaxation. 

Proposition  2.3.1 

Whenever  matrices  ,  A^,  A^,  and  a  vector  b  exist  such  that 


x  e  S  there  are  u,  v  >  0  with  u  integer  and 


(2.3.1) 


A)*  ♦  A^u  ♦  AjV  -  b. 


then  the  following  holds: 


X  e  conv(S)  ♦  there  are  u,  v  >  0  such  that 


(2.3.2) 


AfX  ♦  A2U  ♦  A-jV  ■  b. 


Proof: 


X  c  conv(S)  implies  there  exist  >  0  for  i*l,...,n  such  that 


X  ■  £  X.x.  where  Z  X.  •  1  and 


(2.3.3) 
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e  S  for  i«l , . . . ,n. 


By  taking  a  summation  of  the  following  valid  equalities. 


*1*1*1  *  *2*l“l  *  *3*1*1  ■  *1*'  •  “l 


A.Xx  ♦A^Xu  ♦A^Xv  *Xb  ,  u  integer 
Inn  2nn  3nn  n  n 


we  get. 


n  n  n 

A{  rX.x.)t-A(  IXu.)+A(  j:Xv)«b 
1  i-1  ^  I  2  i.^  i  I  3  i  i 


which  gives  (2.3.2). 


A  MIP-representation  of  S  (2.1.1)  in  which  (2.3.2)  is  bi- 


Q.E.D. 


conditional  (**)  is  called  "sharp."  A  sharp  representation  is  as  accurate 
as  possible  in  its  linear  relaxation.  By  Proposition  (2.1.1),  this  may 
not  be  very  accurate;  but  inappropriate  MIP  representations  can  be  even 
less  accurate. 

Example  2.3.1 


The  bounded  fixed-charge  function 
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f  0  ,  X  •  0 

(  1  ,  0  <  X  < 

W  -  ,  X  [0,1 


has  an  MIP  representation  of  epi(f)  as:  "r  >  _>x,  0  <  x  <1,  z  integer." 

2 

The  LP  relaxation  defines  the  function  _*x  on  [0,1],  and  the  epigraph  of 

2 

— 'x  is  larger  than  conv(epi{f )).  Of  course,  the  MIP  represntation 
2 

"z  >  X,  0  <  X  <  1,  z  integer"  is  sharp.  As  we  shall  see  below,  MIP 
representations  in  common  use  are  often  sharp. 

In  lemma  2.1.6,  we  proved  that  union  of  polytopes  is 
representable.  We  now  show  that  the  corresponding  representation 
depicted  by  (2.1.27)  is  sharp. 

Proposition  2.3.2 

If  all  the  non-empty  polyhedra  P^(i*l , . . . ,s)  have  the  same 
directions  of  recession,  then  (2.1.27)  is  a  sharp  representation  of 

Pj  U  ••• 


Proof : 


We  omit  the  proof  that  (2.1.27)  is  a  representation  of 


P|  This  proof  is  easily  accomplished  by,  e.g.,  using  our 

argument  below  for  the  sharpness  of  the  representation,  and  then  noting 
that  (when  the  X  are  all  integer)  we  have  X.  ■  1  below. 

To  establish  sharpness,  we  need  only  show  one  direction  of  the  bi¬ 
conditional,  and  cite  Proposition  2.3.1.  (To  establish  a  representation, 
we  need  only  show  the  same  direction  of  the  bi-conditional  in  (2.1.1),  as 
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the  converse  direction  is  trivial  for  (2.1.27).  I.e.,  for  the  set  S 

defined  in  (2.1.27),  one  trivially  has  S  p  Pj(j  ...  JJ^g). 

In  (2.1.27)  relax  integer"  to  c  Q"  for  i*l,...,s. 

Then  if  A.  >  0,  we  have  x./A.  •  w.  e  P.» 

1  1111 

We  find  that  x  *  £  A.w^  ♦  £  w^,  where  w^  e  P.  if  A.  >0  and  w^ 

Ai>0  ^  A^-0  ‘  ^ 

is  a  recession  direction  of  P.  if  A.  ■  0. 

1  1 

Without  loss  of  generality,  Aj  >  0  (as  £Aj^  ■  1).  Tlien  A^w^  ♦ 

i  1  i  i  . 

£  w  «  Ai  (w  £  w  /A]).  Since  each  w  is  a  recession  direction  of 

A.*0  A.«0 

1  1 

1  i 

P]  also,  w*  *  w  ♦  £  w  /At  c  P..  Thus,  without  loss  of  generality,  we 

A.«0 

1 

can  assume  x  *  £  A-w^.  Hence  x  e  conv(P.  U  •••U 

A^>0 

Q.E .D. 

If  the  MIP-representation  of  S'  is  sharp  in  Corollary  2.1.4,  the 


construction  gives  a  sharp  representation  of  S.  Thus  sharp 
representations  always  exist. 

Note  that  as  integer  variables  are  arbitrated  (set  to  zero  or  to 

one)  in  a  branch-and-bound  algorithm,  the  representation  (2.1.27)  of  the 

union  of  bounded  sets  S^y  ...(J  S^,  as  obtained  by  following  the  idea  of 

the  previous  paragraph,  does  not  correspond  usually  to  subunions. 

The  juxtaposition  of  sharp  representations  for  two  set  and  S^, 

while  a  representation  for  S  fj  ^  ,  is  typically  not  sharp.  E.g.  if  S 

1  2 
n 

is  a  polyhedron  and  $2  *  Z  is  the  n-dimensional  integers,  the  linear 


A2 


relaxation  of  the  juxtaposed  representations  gives  the  polyhedron 
(which  is  typically  not  the  convex  span  of  the  integer  points  in  S,). 
Thus  the  probletn,  of  obtaining  sharp  representations  of  intersections  of 
representable  sets,  subsumes  cutting-plane  theory  for  linear  integer 
constraints,  and  is  therefore  not  expected  to  have  an  easy  solution 
except  for  special  cases. 

It  is  sometimes  convenient  to  state  the  "polyhedral"  disjunctive 
representation  (2.1,27)  in  an  alternative  form,  called  the  "extreme 
point"  form. 


Let  ■  conv{x  [  j  e  I(i)}  +  C,  where  the  index  set  l(i) 

k 

depends  on  i»l,...,s  and  the  polyhdedral  cone  C  *  cone  {v  j  k  e  K}  is 

independent  of  i.  The  "extreme  point"  representation  of  P.U  , . .U  P 

i  s 


X  ■  L  Z  X.  .x*’-^  ♦  Z  o  V*' 
i-l  jel(i)  kcK  ^ 


(2.3.4) 


A.  -  Z  X.  , 
'  jel(i) 


1  ■  Z  X. 


X^  integer 


>  0  >0 
1  xj  k 
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Proposition  2.3.3 

If  all  the  non-empty  polyhedra  P^(i“] , . . . ,s)  have  the  same 
directions  of  recession,  then  (2.3.4)  is  a  sharp  representation  of 

Pj  U  •••  U  P^. 


Proof : 


Let  S"  be  the  set  defined  by  (2.3.4)  in  variables  x  c  R  ,  and  let 


S  *  Pj  U  •••  U  P3. 

If  X  e  P . ,  let  A.  . ,  0,  >  0  be  such  that  Z  X.  ,  *  1  and 
‘  ‘-I  "  jEl(i) 

ij  k 

X  *  I  X;  ix  +  E  (vv  .  Then  by  putting  X-«l  and  X;*0  for  1, 
jel(i)  keK  ^  J 

(2.3.4)  holds.  Hence  S'oP^.  As  i  was  arbitrary,  S  cS'. 

If  X  e  S",  suppose  that  X^j^  ■  1  in  (2.3.4).  Then  X^  *  0  for  j  *  i 
ij  k 

and  sox*£  ♦£  o.v  e  P*.  Thus  x  e  S',  and  so  S  r*  S'. 

jEl(i)  ^  keK  ‘ 

We  have  established  that  S  ■  S',  i.e.  that  (2.3.4)  is  a 

representation  of  S. 

To  see  that  (2.3.4)  is  exact,  only  one  direction  of  the  bi¬ 
conditional  need  be  established  (i.e.  that  if  x  solves  the  linear 
relaxation,  then  x  c  conv(S)). 

Put  I  ■  {i 1 X^  >  0);  we  have  1  *  t.  For  i  c  i,  define 
X^j  *  Without  loss  of  generality,  Xj  >  0.  With 

have: 


E  E  X^.x 
iel  jcl(i) 


E  (IV 
keK 


Only  (n+l)  constraints  appear  in  (2.3.3).  The  linear  relaxation  of 
(2.1.27)  contains  many  more  constraints. 
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The  "common  wisdom"  regarding  the  Simplex  Algorithm,  in  practical 
applications,  is  that  its  running  time  is  linear  in  the  number  of 
constraints,  and  is  affected  by  the  number  of  variables  only  slightly. 
Clearly,  this  practical  observation  is  based  on  experience  where  the 
number  of  variables  is  not  exponential!  For  "arbitrary"  polyhedra  P^, 
the  number  of  extreme  points  x*^'^  (i.e.  the  sire  of  I(i))  is  exponential, 
so  the  exact  representation  (2.3.4)  is  of  no  practical  value. 

However,  a  second  observation  from  experience  is  that,  when 
representations  are  called  for,  the  which  arise  are  not  "arbitrary"  at 
all,  and  have  very  few  extreme  points.  Part  of  the  reason  for  this  is 
that  the  P^  thus  occurring  are  of  small  dimension,  but  that  is  not  the 
entire  reason.  In  any  event,  for  the  common  representations  there  are 
not  many  variables  and  the  (n+1)  linear  inequalities  needed,  assumes 

primary  importance.  As  we  shall  see  in  the  next  section,  the  most 
efficient  known  linear  relaxations  of  common  experience  -  which  have 
often  been  arrived  at  without  a  general  method  or  explicit  mention  of 
the  sharpness  properties  of  (2.3.5)  are  instances  of  (2.3.3). 

Section  4:  Corollaries  and  Applications 

4.1  Relation  to  Earlier  Results.  Section  2  of  this  chapter 
contains  a  definition  of  bounded-MlP-representable  sets  that  is  closely 
related  to  the  definition  used  by  Meyer  (9).  Specifically,  a  function  f 
is  bounded-MIP-representable  if  and  only  if  f  has  a  rational  MIMM  with 
the  bounded-integer  property  of  (2.2.1).  We  will  indicate  in  this 
section  that  our  Theorem  2.2.1  is  a  generalization  of  the  five  main 
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theorems  used  by  Meyer  for  the  one~dimensional  problem,  for  the  case  of 
rational  representability . 

The  following  result  is  Theorem  2.2.1  of  Meyer  [9]  for  the 
rational  case. 

Theorem  2.4.1 

A  function  g(x),  with  a  bounded  non-empty  effective  domain 
contained  in  R,  is  bounded  MlP-representable  if  and  only  if: 

(1)  Tne  effective  domain  is  the  union  of  a  finite 
number  of  closed  and  bounded  intervals; 

(2)  g(x)  is  lower  semi-continuous; 

(3)  g(x)  is  either  identically  -  «  or  finite  and 
piecewise-linear  (with  a  finite  number  of 
segments)  on  its  effective  domain. 

Proof : 

A  function  g(x)  with  bounded  domain  satisfies  the  given  conditions 

if  and  only  if  epi(g)  is  a  finite  union  of  polyhedra  each  with  a  single. 

* 

common  recession  direction,  x  *  (0,1).  By  Theorem  2.2.1,  a  function 
with  bounded  domain  is  bounded  MIP-representable  if  and  only  if  epi(g)  is 
a  finite  union  of  polyhedra  with  sole  recession  direction  (0,1). 

Q.E.D. 

Meyer's  Theorems  2.3.1  and  2.3.2  concerning  semi-infinite  domains 
are  proven  in  a  similar  manner.  However,  one  and  only  one  polyhedron  Pj^ 

in  the  finite  union  epi(g)  *  Pj  ^  ...  have  two  extreme 

* 

recession  directions,  one  of  which  is  x  ■  (0,1),  and  the  other  unique  to 
the  "final"  unbounded  interval.  This  additional  recession  direction 
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serves  to  restrict  the  functions  thus  representable.  For  example,  the 
conditions  of  Theorem  2.3.1  of  Meyer  involve  an  effective  domain  bounded 
from  below.  The  conditions  are: 

(1)  The  effective  domain  is  a  closed  interval; 

(2)  g(x)  is  lower  semi-continuous  and  continuous  from  the  right; 

(3)  g(x)  is  either  identically  -  *  or  finite  and  piecewise-1 inear 
on  its  effective  domain; 

(4)  If  g(x)  is  not  identically  -  •,  then  c.  <  c^^  where  c^^  is  the 
slope  of  the  "final"  (infinite)  interval  and  c^  the  slope  of 
of  all  other  intervals. 

As  before,  our  Theorem  2.2.1  leads  to  the  condition  (3).  The  new 

recession  direction  (1,  c^^)  of  must  be  a  continuous  recession 

direction  of  epi(g)  ■  Pj  |J  the  continuous  nature  of  this 

recession  direction  will  make  (l),  (2),  and  (3)  necessary.  E.g.,  if  g  is 

not  lower-semicontinuous ,  there  is  x^  e  R  and  6  >  0  such  that  for  a 

sequence  6  0*  we  have  g(x°  ±  6  )  <  g(x°)  -  6.  Since  (x°  +  6  ,  g(x°) 

Tn  m  tn 

-  6)  c  epi(f)  ■  Pj  IJ  •  *  •  U  epifg)  is  closed,  we  have 

(x°,  g(x°)  -6)  e  epi(g),  which  is  impossible.  For  a  second  instance,  if 

g  is  not  continuous  from  the  right,  by  lower  semi-continuity  there  exists 
x°and  6  >  0  such  that  for  a  sequence  6  ♦  0  g(x°  ♦  6  )  >  g(x°)  +  6. 

in  TTi 

But  as  epi(g)  has  a  recession  direction  (1,  c„)  we  also  have  g(x°  ■v  6  )  < 

N  w 

g(x°)  +  c„6  ,  and  thus  a  contradiction.  Our  Theorem  2.2.1  can  also  be 
N  m 

directly  applied  to  obtain  the  sufficiency  of  (1)  -  (4). 
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Meyers  Theorem  2.3.2  allows  the  effective  domain  to  be  unbounded 
from  the  left.  Only  condition  (4)  from  above  is  altered  to  read: 

(4)  If  gCx)  is  not  identically  -  •,  then  c^  <  c.  for 

i“2 . .  where  c^  is  the  slope  of  the  "initial  (unbounded 

from  below)  interval. 

Identical  arguments  from  Theorem  2.2.1  also  establish  this  result. 

Theorem  2.4.1  of  [9],  involving  a  domain  which  is  unbounded  in  both 
directions,  amounts  to  juxtaposing  the  two  cases  of  unbounded  domains 
given  above. 

One  advantage  of  our  approach  via  recession  directions,  is  that  it 
indicates  the  generalization  of  these  results  to  functions  of  more  than 
one  variable;  and  the  statement  of  the  result  is  succinct,  rather  than 
involving  a  list  of  conditions. 

4.2  Application  to  Other  Models.  In  this  section  we  give  en 
instance  in  which  the  inequality  description  of  a  function  (2.1.27), 
which  results  in  a  disjunctive  MIP-representation,  becomes  Beale's 
representation  of  a  certain  model  studied  in  [2,  p.  216]  (when  an  extreme 
point  formulation  (2.3.4)  of  the  linear  inequality  system  is  employed). 

Beale's  representation  for  an  approximation  of  a  general  function 

g(x,y)  ■  xf(y),  where  f(y)  is  nonlinear,  is  equivalent  to  (see  I3]): 

2  s 

z  >  L  r  X. .  X.  f(Y.)  (2.4.1) 

i-1  j=l  ^ 


2  s 

I  I  X. .  -  1 

i«l  j-i 


I  (X,  X, ,  ♦  X  X  ,) 


2  s 

y  -  r  I  X. .  Y. 
i»l  j«l 


X. .  ♦  X^.  <  X.  for 

»J  2j  j 


where  £  X  *1  and  X  binary, 

j-1  J  J 


where  X,  ■  X  • 

-1  -min 

the  "grid  points 

interval. 

Our  linear  inequality  method  of  representing  the  same  function 
g(x,y)  *  xf(y)  proceeds  in  two  steps.  First,  we  isolate  the  following 
function  as  a  "piecewise-linear"  approximation  of  g: 


g(x,y)  -  xf(Yj)  if  y  -  Yj  and  X^-^^  ^  ^ 


(2.4.2) 


xf(Y  )  if  y  -  Y  and  X„-  <  x  <  JL^v 

-s  ^  ~s  -min  "max 

Next  note  that,  for  each  j«l . s  the  set 

Pj  •  {(*.Xiy)|x  >  xf(Yj),  <  x  <  y  -  Yj  is  a  polyhedron  with 
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(1,0)  as  its  sole  recession  direction,  and  that  epi(g)  ■  Pj 

aw 

Using  (2.1.27)  to  represent  epi(g),  we  obtain: 


s 

I  X.r. 

j-1  J  J 


s 

I  X.x, 

j-1  J  J 


s 

I  X.Y. 
J-J 


..  .  .jt.Y.) 


X  .  X.  <  X  <  X  X. 
-min  j  -max  j 


I  X.  »  1  ,  X.  integer, 

j-1  J  J 


An  extreme  point  representation  of  (2.4.2)  alters  expression  (2.4.2)  as 
follows.  Since  the  X.  are  binary  variables,  and  £  X.  -  1,  there  are 


continuous  variables  Xjj  and  X2j  with  Xj  -  X^j  X2j  and  x^  *  Xj^ 


X, .  X  .  We  have: 
2j  -max 
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which  is  identical  to  (2.4.1).  An  important  note  for  applications  is 
that  the  specific  representation  used  by  Beale  contains  more  variables, 
but  fewer  constraints  than  the  linear  inequality  formulation.  Thus  for 
large  problems,  Beale's  technique  is  better  suited  for  L.P.  pivoting.  As 
a  general  rule,  we  can  always  use  the  extreme  point  representation  of  the 
polyhedra  in  (2.1.27)  in  place  of  the  linear  inequality 
representation,  if  that  is  advantageous. 

2.4.3.  Sharpness  in  Separable  ProgrammxnE .  an  Application 
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linear  function  gCx^).  Clearly,  our  definition  of  sharpness  for  each 
approximrtion  g  (see  2.3.2)  and  text)  is  iden-ical  to 


epiig^}  «  conv{epi(g£)), 


(2.4.4) 


where  is  the  function  derived  from  the  LP  relaxation  ot  the  MIP- 
representation  of  g^(x). 

We  next  show  that  an  approximation  of  gj(x),  defined  on  a  bounded 
interval  T,  may  be  constructed  with  a  finite  union  of  polytopes,  which  is 
MIP-representable.  We  then  adapt  the  disjunctive  MlP-representation  used 
earlier  in  this  thesis  (see  2.1.27)  and  (2.3.4). 

Proposition  2.4.2 

Let  g(t)  be  a  piecewise  linear  function  of  one  variable  t  on  a 
closed,  bounded  interval  T.  Suppose  that  g(t)  is  linear  on  each  interval 

[Cj_j,  Cj)  (k“l . s),  where  the  intervals  are  disjoint  except  for  end 

points,  and  have  union  T.  For  each  k,  define  the  polyhedron 


P^  •  {(z,t)  E  R  |t  e  [cj_j,  Cj],  z  >  g(t)}. 


(2.4.5) 


Then  (2.1.27)  and  (2.3.4)  both  define  a  sharp  representation  of  g. 

Proof; 

This  follows  from  Proposition  2.3.3,  since  (1,0)  is  the  common  and 
only  direction  of  recession  of  the  polyhedra  P^^. 
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Q.E.D. 

We  illustrate  the  function  g  in  Proposition  2.4.2  in  fig.  2.1. 

Here  a.  is  the  slope  of  g(t)  within  the  jth  interval,  L.  »  fc.  ,,  c.l, 

J  re.  j  J  J“1  J 

and  bj  ■  g(cj),  the  end  point  values  of  gft).  Using  the  matrix  notation 
from  lemma  2.1.6,  we  describe  a  polyhedral  union  description  of  g(tl  as: 


s 

and  enter  *  ■  Z  (X.b.  ,  a.t.)  in  the  minimizing  criterion  function 

j.j  J  J-l  J  J 

wherever  g(t)  is  needed. 

We  next  shall  adapt  the  "extreme  point"  description  to  our 
piecewise-linear  example  (fig.  2.4.1)  to  develop  an  MIP-representation 
that  also  remains  "sharp"  after  binary  variables  are  arbitrated. 


The  extreme  points  of  P^  in  (2.4.5)  are  ^i'^' 

The  sole  extreme  ray  (1,0)  can  be  omitted  in  (2.3.4),  since  a  minimum 
value  of  z  is  sought.  Then  the  linear  relaxation  (2.3.5)  of  (2.3.4) 


becomes: 


\  V  * 


*-l  2 

I  I  (X.,c.  +  A  c.  ,) 
i-1  j-1  ‘  "2  1*1 


(2. A. 9) 


5-1  2 

1  «  I  IX.,  ,  all  X.  ,  >  0 
i«l  j«i  ^2 


8-1  2 

where  z  ■  I  I  (X.  b.  +  X..„b.  )  is  entered  in  the  criteron  function, 

i.l  j-i  11  1  12  1+1 

Actually,  a  small  simplification  enters  in  this  case,  if  we  set 

0,  ■  ..  6  ‘  ^  and  e.  •  X.  .  ♦  X.  ,  if  1  <  i  <  s.  Then  (2.4.9) 

1  11  s  s-1,2  1  1*1,2  1,1 

becomes : 


X  ■  I  6.C , 
i»l  ^  ^ 


1  -  £  G.  ,  all  e.  >  0 

i-1  ^  ‘ 


and  z  -  I  G.b.  is  entered  in  the  criterion  function.  This 
i»l  *■ 

simplification  occurs  because  the  right  most  point  of  one  interval  is 


also  the  leftmost  point  of  the  next  interval.  If  there  were  a  "gap  in 
the  domain,  our  method  would  still  work,  but  the  simplification  would 


The  system  (2.4.10)  is  the  most  compact  relaxation  of  a 
formulation  that  is  known  for  separable  programming,  and  is  given  in  |2l. 
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It  involves  only  ttra  constraints,  and  a  number  of  variables  6  equal  to 
the  number  of  intervals. 

From  Proposition  2.3.3,  the  system  (2.4.10)  is  exactly  a 
formulation  for  g  ,  Thus  if  g  is  convex  (i.e.  has  convex  epigraph),  so 
that  g  *  g,  the  system  (2e4.10)  is  a  formulation  of  g,  as  noted  in  [2l« 

aw 

When  g  is  not  convex,  [2]  recommends  a  pivot  rule  which  cannot  always  be 
used  to  optimality.  Alternatively,  one  can  still  solve  (2.4.10)  and 
gather  what  information  is  possible  via  this  "best  possible"  linear 
relaxation. 

2.4.4  New  Representations.  At  this  time,  the  best  of  the  existing 
representations  for  functions  of  one  variable  have  nice  properties.  A 
primary  applications  area  for  our  techniques  is  to  functions  of  several 
variables. 

We  focus  on  a  useful  function  of  two  variables  below,  and  use 
rectangular  domains.  This  use  of  rectangular  domains  can  have 
limitations,  depending  on  the  application,  if  many  variables  are  to  be 
accommodated  (due  to  the  growth  in  extreme  points).  However,  that 
difficulty  need  not  arise  if  simplicial  domains  can  be  profitably  used. 
Our  example  to  follow  is  intended  simply  to  illustrate  our  approach. 

If  activity  i  is  employed,  a  fixed  charge  f^  is  incurred  (i«l  or 
2).  If  both  are  employed,  a  fixed  charge  f^  is  incurred,  and  need  not 
be  the  sum  fj  ♦  f^.  Specifically,  we  are  to  model  the  fixed  charge 
function: 


57 


0  ,  if  •  0; 

g(Xj,X2)  *  fj,  xf  0  <  Xj  <  x^  •  0;  (2.4.11) 

if  Xj  -  0,  0  <  x^  <  M^; 

f  ,  if  0  <  X,  <  M, ,  0  <  X-  <  M„; 
b  11  2.  i 

V. 

The  bounds  and  will  be  necessary,  as  they  were  for  the  one¬ 
dimensional  fixed-charge  problem.  As  we  chose  to  avoid  unbounded  integer 
variables,  we  seek  a  bounded  integer  representation. 

It  can  be  shown  that,  for  a  representation  to  exist  for  (2.4.11), 
we  need  these  conditions  satisfied: 

0  <  f,,  0  <  f-,  max{f,,f^}  <  f^  (2.4.12) 

12  1  2  D 

In  fact,  if  (2.4.12)  fails  epi(g)  is  not  closed  (see  Proposition  1.10). 
The  conditions  "0  <  f^"  simply  require  that  we  have  a  true  fixed-charged 
(rather  than  fixed-benefit)  problem.  Also,  "max{fj,  f2^  ^  always 

be  arranged.  (If  it  at  first  appears  that  f2  >  f^>  always  set  up  for 
both  activities  when  you  set-up  for  activity  two,  and  use  f2  “ 

When  (2.4.12)  holds,  we  have 

epi(g)  »  PpU  PjU  P2^  **3  (2.4.13) 

where  P^  ■  {(z,0,0)  |  z  >  0),  P^  ■  {(z,x^,0)  |  r  >  and  0  < 

P2  ■  {(z,0,X2)  1  z  >  (2,  0  <  X2  <  M2},  and  P^  »  {(z,Xj,X2)  1  z  > 

0  ‘  ‘  "l-  0  ‘  -2  ‘"2>- 

We  shall  investigate  the  linear  relaxation  (2.3.5);  and  as  above. 
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since  a  minimum  value  of  z  is  sought  we  can  omit  the  common  recession 
direction  (1,0,0). 

We  obtain  the  following  for  (2.3.3): 


’‘l  “  ^^2  *  ^32  ^ 


(2.4.14) 


X,  -  (X,,  ♦  X„  ♦  X,^  )  M 
2  22  32  34  2 


2  2  4 

*  £>>,.+  ^  K.  *  Z  X  -  1 

0)  j»l  j*l 


all  X. .  >  0 

IJ 


vh.r.  .  -  (Xj,  *  .  (Xj,  *  X^j)£j  .  (X3,  .  X33  .  X33  .  X3^)f^  is  to 

be  put  into  the  objective  function. 

To  obtain  (2.4.14),  the  following  list  of  extreme  points  were 


used,  with  multipliers  X..  >  0  as  indicated: 

13 


There  are  substantial  simplifications  in  (2.4.14).  For  example,  positive 
values  of  or  deteriorate  (raise)  the  value  of  x,  yet  they 

do  not  occur  in  the  expressions  for  x^  or  x^;  and  also  occurs  nowhere 
except  in  the  last  constraint.  Thus  we  may  take  Xjj  ■  X2J  *  X^j  *  0. 
Moreover,  if  X22  ^  0,  by  decreasing  X22  and  increasing 
retain  feasibility  and  can  only  decrease  z  (since  f^  <  fj^  by  (2.4.12). 
Thus  we  may  take  X22  *  0.  A  similar  argument  allows  the  simplification 
Xjj  *  0.  We  have  thus  eliminated  five  of  the  nine  variables  in  (2.4.14), 
and  X^j  can  eliminated  in  favor  of  an  inequality  convexity  constraint. 
Only  three  variables  remain:  ^j2»  ^22’  ^34'  this  depends  only 

on  the  necessary  conditions  (2.4.12).  We  obtain: 
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*1  “  ^^2  *  (2.4.14) 

*2  '  ^^22  *  S4^”2 

^2  ‘  ‘22  *  Si.  ‘  *■  *"  \j  *  '’• 


where  2 


is  entered  in  the  objective  function. 


22  2  34  b 

It  can  be  shown,  in  the  common  instance  fjj  ^  fj  +  (2* 

not  have  both  and  X^j  positive  when  2  is  minimi2ed.  By  algebraic 


substitution,  the  case  X 


12 


0  is  equivalent  to  ^  while  the 


case  X22  =  0  is  equivalent  to  <  x^/M^.  The  optimal  value  z(x^,x^) 

of  2  in  the  minimum  can  be  computed  to  be 


z(x, ,x  ) 
1  2 


(f^^-f2)Xj/Mj  ♦  *2^2  (2.4.15) 


flXi/M)  ♦  (f^  -  ij>X2/M2,  if  x^/M^  < 


As  we  are  assuming  ♦  f^  to  obtain  (2.4.15),  we  see  that  always 

z(xpX2)  <  (2’'2^^2’  equality  holding  if  =  f^  ♦  f^. 

Note  that  +  ^2*2^^2  individual  linear 

relaxations  when  joint  effects  are  ignored.  In  the  case  f,  *  f.,  =  f.  *  f 

12b 

that  one  set-up  pays  for  both  activities,  (2.4.15)  gives 


z(xj,X2)  *  f(max{Xj/Mj,  ’'2^^2^^  (2.4.16) 

which  is  a  formula  that  can  be  used  directly  to  replace  the  system 
(4.14).  (This  also  can  be  done  with  (2.4.15.) 


.  A' 


.  Vv<".  ■'• 

UUUUJaAiai 
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In  the  less  usual  case  ^  ^2  ^  special  "additional 

penalty"  is  levied  when  both  activities  are  undertaken,  we  note  that 

(2.4.17) 

Xj/Mj  +  X2/M2  <  1, 

Xj,  x^  >0 

Indeed,  *  *2^^2'  ^34  *  ^  solves  (2.4.14)  and  gives  this 

value,  while  any  increase  in  produces  a  larger  value  of  z  (consider 

A,,  as  a  non-basic  variable,  and  note  that  its  reduced  cost  is  f.  -  f.  - 
34  b  1 

£2  >  0).  If  x^/M^  X2/M2  >  1,  we  must  have  A^^  >  0  in  (2.4,14).  Then 
from  f^  >  fj^  +  f^,  one  can  show  that  A^j  A^j  X^^  *  !•  Upon  solving 
the  resulting  three  linear  equations  for  z  we  obtain: 

z(x,,x  )  -  (f,  +  f.,  -  f,.)  +  (r  -  fjx  /M,  ♦  (f,  -  (2.4.18) 

1’2  12b  b211  bl22 

whenever  ^  *  ’  *l'’*2’  *  ^ 

For  the  expression  (2.4.18),  it  can  be  shown  that  z(x^,X2)  >  f^x^/M^  + 
f^x^/M^  in  the  region  described,  i.e.  the  relaxation  value  is  higher  than 
the  sum  of  the  independent  values  (as  it  should  be). 

The  algebraic  simplifications  performed  mechanically  above  only 
amount  to  removing  extreme  points  of  a  which  is  no  longer  extreme  in 
K  “  conv  (Pq  U  Pj  U  ^2  UPj)«  In  an  easily  visualized  formulation 
(2.4.11)  like  the  present,  geometric  intuition  can  speed  the  process. 

Since  K  is  a  polyhedron,  the  linear  relaxation  z(Xj,X2)  of  g  is 


the  maximum  of  a  finite  set  of  linear  forms.  Here,  more  than  two  forms 
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was  not  needed.  The  expressions  for  zCx^.x^)  obtained  in  this  way  can  be 
used  to  replace  the  system  (2.A.14t),  if  that  is  desired.  The  linear 
forms  involve  are  exhibited  above;  we  leave  the  details  to  the  reader, 
and  note  only  that  this  process,  too,  is  part  of  a  general  procedure. 
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EXPERIMENTAL  RESULTS  ON  THE  NEW  TECHNIQUES 
FOR  INTEGER  PROGRAMMING  FORMULATIONS 

Section  1;  Introduction 

A  widely  accepted  view  in  integer  programming  research  is  that 
progress,  in  utilizing  (mixed')  integer  formulations  to  solve  real-world 
problems,  will  rely  primarily  on  algorithm  advances  (for  general  and 
special  structures)  and  clever  coding  tricks.  In  this  view,  we  are 
perceived  as  already  knowing  how  to  represent  a  real-world  problem  with 
integer  variables;  those  simple  "formulation  techniques"  appear  early  in 
the  subject  and  by  now  are  widespread  in  the  master’s  level  and  even 
undergraduate  level  textbooks,  (see  e.g.  |7))  However,  since  in  the 
1970 's,  the  experience  of  practitioners  indicates  that  some  major  issues 
of  formulation  have  been  overlooked  (see  e.g.  18J,  [17),  [23],  [24], 
[25]). 

In  this  chapter,  we  are  going  to  provide  furthur  experimental 
results,  which  favor  certain  new  integer  formulations  that  we  introduced 
in  chapter  II.  Our  formulations  are  not  ad  hoc  "practitioners  tricks," 
but  derive  from  a  systematic  study  of  modelling.  These  formulations  are 
either  directly  derived  from,  or  motivated  by,  developments  in  the 
"disjunctive  methods,"  particularly  as  in  [2]  and  [14]  (see  [13]  for  a 
Survey  of  these  methods).  R.  R.  Meyer  [19],  [20],  [21],  [22]  and 
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T.  Ibaraki  fl2]  initiated  the  systematic  study  of  "integer  modellings"; 
see  also  Il5]. 


Ve  caution  that  our  experimental  results  are  preliminary:  the 
problems  studied  are  "small"  by  industrial  standards.  Nevertheless,  the 
advantage  of  the  newer  formulations  does  appear  to  increase  with  problem 
sire.  Also  the  initial  linear  program  solved  is  typically  much  tighter 
than  in  standard  formulations,  a  fact  which  is  code^independent.  We  use 
the  Land  and  Powell  code  because  of  its  accessibility  and  excellent 
documentation.  We  also  run  problems  using  C.  H.  Martin's  BANDBX  code, 
and  CDC's  APEX  IV  mixed-integer  code. 

We  do  not  contend  that  the  newer  formulations  will  necessarily  be 
better,  in  terms  of  CPU  time,  than  those  previous.  They  often  are  not, 
for  example,  on  tiny  problems  (fewer  than  10  binary  variables). 

Moreover,  of  the  potentially  limitless  number  of  different  formulations 
from  the  real  world,  we  study  only  a  few  common  ones.  Nevertheless,  the 
new  modelling  techniques  are  quite  "automatic,"  easy  to  learn  and  to 
apply  (either  by  hand,  or  in  a  "wrap-around"  of  a  standard  MIP  code),  and 
seem  to  markedly  improve  performance  in  some  common  problems. 

The  modelling  techniques  we  have  developed  have  these  properties: 
(1)  They  are  automatic,  and  do  not  depend  on  an  ad  hoc  analysis  of  the 
problem  to  be  modelled;  (2)  The  linear  relaxation  is  optimal  for  the  set 
modelled;  (3)  Under  mild  assumptions,  variable  arbitration  in  branch-and- 
bound  leads  to  a  modelling  of  our  type,  so  that  reformulations  are  net 
needed  in  lower  nodes  of  the  search  tree. 
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For  related  work  on  problem  formulation,  see  [4],  [6],  [17],  [23], 
[24],  [25]. 

Here  is  the  plan  of  this  chapter. 

In  Section  2,  ve  quickly  review  a  few  basic  concepts  and  results 
from  chapter  II,  and  also  introduce  the  idea  of  "modelling  linkage."  We 
explain  the  "sharpness"  concept,  which  is  part  of  what  differentiates  our 
modellings  from  those  previous.  This  is  sufficient  for  the  reader  to  go 
on  to  the  description  of  our  experiments,  and  to  our  experimental 
results.  In  Appendix  A,  we  provide  (what  is  essentially)  a  tutorial  on 
those  results  and  techniques  from  chapter  II  which  are  relevant  here. 

The  appendix  is  optional  reading. 

Section  3  presents  an  experiment  on  sharpness,  which  we  test  in 
the  setting  of  a  multi-divisional  firm  (like  that  considered  by  Dantzig 
and  Wolfe  [6])  where,  moreover,  each  division  has  a  choice  of 
technologies  at  its  disposal.  This  variant  of  "multiple  choice 
constraints"  is  tried,  since  some  hand  calculations  with  these  kinds  of 
constraints  indicate  a  dramatic  lack  of  sharpness,  in  the  formulations 
given  in  most  articles  and  common  textbooks.  Our  experiment  confirms  a 
dramatic  advantage  for  the  newer  formulations. 

Section  4  presents  an  experiment  on  modelling  linkage,  which  we 
test  in  the  setting  of  separable  concave  functions  with  multiple  fixed- 
charges.  Our  experimentation  here  is  more  limited,  but  it  confirms  a 
definite,  but  smaller,  advantage  of  the  newer  modellings. 

Section  3.2:  What  the  Experiments  Test 

We  begin  this  section  with  a  short  summary  of  some  theoretical 


results . 
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A  set  S  of  rationals  in  Q  is  called  bounded-MIP-representable .  if 
there  are  rational  matrices  ,  A^,  A^  and  a  rational  vector  b,  as  well 
as  a  bound  B,  such  that: 


X  e  S  «*  there  are  u,  v  >  0,  with  u  integer 

and  ||u||  <  B,  such  that 

A,x  +  A-U  ♦  A.v  *  b 
12  3 


(3.2.1) 


Functions  f  which  occur  only  in  a  minimization  objective  are  represented 
via  their  epigraph  epi(f)  *  {(z,x)  |  z  >  f(x)};  their  domains  may  be  a 

general  subset  of  Q  .  This  will  then  coincide  with  Meyer's  definition 
of  function  representability  ll8]  in  rationals.  (Caution:  Functions 

occuring  otherwise  may  requite  representation  by  graphs  or  hypergraphs). 

A  more  general  concept  of  MIP  representability  is  given  in  chapter 
II  but  we  do  not  use  it  here.  Moreover,  unlike  some  previous  work  (e.g. 
[19]),  all  our  representations  are  in  the  rational  field,  and  are  of  sets 
of  rationals. 

Here  is  a  general  result  chapter  II,  which  characterizes  bounded- 
MlP-represent ability. 

Theorem:  The  set  S  C  q"  is  bounded-MIP-representable,  if  and  only  if, 

both  these  conditions  hold: 

o)  S  is  a  finite  union  of  polyhedra; 
o 

B)  Whenever  x  e  S  and  y  c  Q  are  such  that 


m-. 
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o 

X  ny  e  S  for  all  n*l,2,3,  then  for 

every  x  c  S  and  all  X  >  0  we  have  x  Xy  c  s. 

By  the  Theorem,  any  finite  union  of  (rational)  polytopes  (bounded 
polyhedra)  is  bounded-MIP-representable,  since  always  y®0  is  forced  when 
the  hypothesis  of  3)  holds,  so  that  8)  is  vacuous.  The  Theorem  also 
shows,  that  the  unbounded  fixed-charge  problem  is  not  bounded-MIP- 
representable.  (Actually,  Meyer  earlier  showed  it  is  not  representable 
at  all).  The  function  for  this  charge  is 


with  c  >  0.  We  have  epi(f)  »  P^  UP2»  |  ^  >  0}* 

?2  “  {(z,x)  I  X  >  0,  2  >  c),  so  that  condition  a)  of  the  Theorem  holds. 

0  0  o 

However,  with  (z  ,  x  )  *  (c,0),  y  •  (0,1),  and  (z,x)  ■  (0,0),  we  find 
that  condition  0)  fails.  This  explains  why  we  need  an  upper  bound  on  x 
to  get  a  representation  for  fixed-charges. 

The  Theorem  "almost"  says,  but  does  not  actually  say,  that  when  S  is 
written  as  a  union  of  polyhedra  S  ■  PjU  P^U  ...U  P^,  then  all  P^  have 
the  same  recession  directions.  For  example,  if  t*2,  Pj  “  {(Xj,  x^)  1 
0<Xj<l,0<X2<l},  Pj"  1  >  1 •  0  <  x^  <  2, },  then  by 

the  Theorem  S  ■  P|  Up2is  representable,  even  though  Pj  has  no  recession 
directions  and  P^  has  (1,0)  as  a  recession  direction.  (Tlie  exact 
"placement"  of  Pj  relative  to  P^  matters;  for  example,  if 
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Pj  ■  {(Xj,X2)  I  -1  <  Xj  <  0,  0  <  x^  <  1 },  then  S  ■  Pj  U  P2  i* 

bounded-MIP-representable) .  However,  if  C  is  the  sum  of  the  recession 

cones  of  the  P.  for  a  bounded-MIP  representable  set,  note  that  S  *  (P. 

1  1 

I 

C)  ...  (P^  ♦  C).  Hence,  "without  loss  of  generality,"  the  “  Pj 

C  can  be  taken  to  have  the  same  recession  directions. 

Note  that  some  important  sets  are  not  bounded-MIP-representable 
(indeed,  not  representable  at  all),  as  for  example  the  "complementarity 
set": 

f(x^,X2)  •  {(Xj,X2)  >  0  I  Xj'x^  *  0}  -  Pj  U  P^,  where 

?!  ■  {(Xj  x^)  >  0  1  Xj*  0}  and  P^  ■  {(x^,X2)  >  0  |  x^  -  0). 

(One  easily  verifies  that  0)  fails).  Meyer  and  Thakker  have  introduced 
other  representability  concepts  for  such  a  set  [22],  the  concept  of 

"polyhedral  union"  representability,  which  will  not  concern  us  here. 

Our  focus  in  this  chapter  is  not  on  the  issue  of  the  existence  of 
representations,  but  on  certain  qualities  they  may  have,  when  they  exist. 
The  first  significant  quality  is  that  of  "sharpness." 

A  specific  bounded  modelling  Aj ,  A2,  A^,  b  for  a  set  S  is  called 
sharp,  if,  in  addition  to  (3.2.1): 

X  c  clconv(S)  -  there  are  u,  v  >  0,  (3.2.3) 

I |u| I  <  B,  with 

AjX  +  A2U  +  AjV  «  b. 


In  (3.2.3),  the  integrality  condition  is  dropped  ("relaxed")  on  u,  and 


clconv(S)  is  the  closure  of  the  convex  span  of  S.  In  chapter  H,  it  is 
shown  (and  it  is  an  easy  result)  that,  whenever  (3.2.1)  holds,  then 

X  e  clconv(S)  ♦  there  are  u,  v  >  0,  l|ulj  C  B,  (3.2.4) 
with 

A)*  +  A^u  +  A^v  -  b. 

Therefore,  a  modelling  is  exactly  sharp  if  its  linear  relaxation  is  as 
"tight"  (i.e.  small)  as  possible. 

In  branch-and-bound  codes,  the  algorithm  uses  the  linear 
relaxation  of  a  mixed  integer  program,  both  for  fathoming  and  for  guiding 
the  search  strategy.  Therefore  one  might  heuristically  conclude  that 
sharp  formulations  should  be  superior  to  nonsharp  ones.  This  is  not  an 
exact  deduction,  since  sharp  modellings  may  contain  a  different  (often, 
but  not  always,  larger)  number  of  variables  and  constraints.  Moreover, 
so  many  heuristic  devices  are  used  in  branch-and-bound,  which  are  not  (in 
some  sense)  "monotone"  with  the  size  of  the  linear  relaxation,  that  an 
exact  analysis  seems  very  difficult.  However,  the  heuristic  principle  is 
clear  enough.  Moreover,  we  discover  that  many  of  the  "textbook 
formulations"  are  not  sharp,  so  an  experiment  seems  in  order. 

The  concept  of  "sharpness"  is  introduced  by  Meyer  (21)  tor 
functions,  with  the  larger  descriptor,  "linear  relaxation  optimal."  In 
chapter  II,  we  provide  sharp  formulations  for  all  bounded-MIP- 
representable  (really,  all  MIP-representable)  sets,  which  also  retain 
sharpness  in  many  (but  not  all)  situations  as  the  variables  arc 
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arbitrated  by  a  branch-and-bound  code.  The  second  property,  which  one 
might  call  "hereditary  sharpness,"  is  important  when  one  cannot  intervene 
in  the  code  at  each  node,  to  "touch  up"  the  representation.  Our 
formulations  are  presented,  with  worked  examples,  in  Appendix  A. 

We  now  wish  to  introduce  the  newer  concept,  of  "modelling  linkage." 

Representations  are  not  used  by  themselves,  but  as  parts  of  larger 
programs.  For  example,  we  may  have  a  program: 


min:  cx 


(3.2.7) 


subject  to: 


Ax  ■  b 


1 

1  1 

1  1 

AjX 

♦ 

A2U 

+  A3V 

2 

2  2 

2  2 

AjX 

♦ 

A2U 

+  A3V 

1  2 

1 

X, 

u 

.  u 

,  V  ,  V 

1 

2  . 

u  ,  u  integer 
Xj  integer,  j  c  J. 

Note  that  the  constraints  of  (3.2.7)  actually  give  a  bounded-MIP- 
represention  of  the  set 

S*{x>0|Ax»b,  xCSj^U  S^,  Xj  integer  for  j  e  j}. 
However,  it  is  important  to  realize  that  this  may  fail  to  be  a  sharp 
presentation  of  S,  even  if  one  begins  with  sharp  representations  of 
and  S2*  In  particular,  the  representation  of  S  obtained  by  our  methods 
(which  give  sharp  representations)  may,  and  often  do,  differ  from  the 
constraints  of  (3.2.7). 

Modelling  linkage  refers  to  the  way  that  the  representations  of 
smaller  side  conditions  (like  x  c  SJ  are  made  independently,  and  then 
simply  "attached"  or  "linked  into"  the  whole  program  (3.2.3).  By 
sharpness  considerations,  one  ideally  would  like  to  represent  all  of  a 
program  at  once  (the  set  S  in  the  example  above),  rather  than  by  "pieces" 
which  are  attached  on.  However,  generally  the  size  of  "large  chunk" 
formulations  can  grow  multiplicatively,  so  some  compromise  is  often 
needed.  (However,  see  later  for  an  instance  where  the  "large  chunk" 
formulation  is  smaller.) 

The  issue  of  modelling  linkage  can  be  a  very  subtle  one,  as,  for 
example,  a  representation  of  S'  “  (x  t  S  |  x^  integer  for  j  c  J)  can  be 
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superior  to  one  of  alone.  It  is  best  clarified  in  a  more  general 
setting  (similar  to  that  of  the  co-propositions  of  |14)).  Modelling 
linkage  is  quite  clearly  related  to  the  issue  of  sharpness,  but  it  goes 
further,  in  that  it  concerns  the  setting  in  which  sharpness  is  sought. 

In  addition  to  the  "sharpness"  aspect  of  modelling  linkage,  there 
is  a  second  subtlety  subsumed  in  this  concept,  which  we  call  "variable 
co-ordintion."  For  example,  if  a  piecewise-linear  function  of  one 
variable,  which  also  has  fixed-charges,  is  to  be  modelled  in  two  separate 
sections  -one  for  the  piecewise-linear  part,  and  one  for  the  fixed 
charges  -  the  separate  modelling  may  not  take  explicit  care  to  insure 
that  the  same  interval  of  function  value  is  being  considered  in  one  part 
as  in  the  other. 

Particularly,  after  some  integer  variables  controlling  segments  of 
the  function  have  been  arbitrated  to  specific  binary  values,  it  is  quite 
possible  that  in  some  subproblems  of  the  branch-and-bound  tree,  different 
segments  are  being  considered  in  the  separate  modellings.  This  fact  need 
not  show  up  as  a  simple  inconsistency  of  the  linear  program  for  that 
subproblem,  but  it  can  greatly  multiply  (unnecessarily)  the  size  of  the 
branch-and-bound  tree.  We  say  in  such  cases  that  the  segment  variables 
are  "unco-ordinated,"  and  such  a  lack  of  co-ordination  would  not  occur  in 
a  simultaneous  modelling  of  both  aspects  of  the  function  together. 

Incidentally,  in  some  cases  variable  co-ordination  can  be  achieved 
among  the  separate  parts  of  the  modelling,  without  going  to  our 
techniques;  but  this  phenomenon  of  co-ordination  seems  not  to  have  been 
discussed  previously.  Variable  co-ordination  refers  primarily  to  the 


hereditary  properties  of  modellings  (i.e.  how  they  behave  as  integer 
variables  are  arbitrated). 
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For  a  preliminary  test  on  modelling  linkage,  and  hence  also 
variable  co-ordination,  we  choose  a  problem  in  separable  programming  of 
the  type  discussed  above.  Each  objective  function  involves  a  single 
variable,  and  is  piecewise-linear,  except  for  "fixed-charges"  (or,  in  the 
economist's  terminology,  "set-up  charges"  -  as  all  our  costs  are  variable 
prior  to  the  decision  made  in  connection  with  the  model). 

We  use  four  formulations;  the  first  two  are  "separate" 
formulations,  one  for  the  fixed  charges  and  one  for  the  piecewise-linear 
segments  ,  which  are  separately  attached  to  the  main  program.  They  are 
similar  to  formulations  found  in  textbooks.  The  third  formulation  is  our 
sharp  formulation  for  the  entire  function  (done  at  one  modelling).  The 
fourth  formulation  is  of  the  "separate"  variety,  and  as  each  separate 
part  comes  from  our  techniques,  each  separate  part  is  both  sharp  and 
hereditarily  sharp.  (One  of  the  textbook-like  formulations  is  not  sharp, 
although  it  did  quite  well;  the  second  is  not  hereditarily  sharp.) 

As  it  turns  out,  the  third  formulation,  which  involves  the  entire 
function,  also  requires  half  as  many  binary  variables  -  an  example  of  how 
a  smaller  program  can  sometimes  have  a  tighter  linear  relaxation!  By  the 
usual  rule  of  thumb,  the  third  formulation  ought  to  run  faster,  and  its 
improved  sharpness  should  increase  the  effect.  However,  we  run  the 
experiment  to  be  certain,  and  it  does  confirm  our  expectations. 

Incidentally,  the  data  of  the  type  used  in  our  second  experiment 
is  such,  that  we  believe  our  favorable  results  are  due  to  the  variable 


We  begin  this  section  by  describing  a  set  of  problems  that  will 
allow  us  to  empirically  compare,  our  modelling  for  either/or  constraints 
with  the  standard  textbook  modelling.  In  our  formulation,  these  mixed 
integer  programs  have  binary  integer  variables  in  special  ordered  sets, 
where  for  each  set  of  binary  variables  at  most  one  is  non-zero.  However, 
we  do  not  incorporate  special-ordered  set  branching. 

The  scenario  for  the  test  problems  involves  a  corporation  which  has 
several  divisions  producing  different  end  products.  Each  division  has 
the  choice  of  different  Lechnolcgies  in  producing  these  products,  and  the 
products  produced  by  individual  divisions  are  different.  The  technology 
chosen  affects  the  total  output  and  product  mix  of  each  division.  Since 
the  corporation  must  satisfy  financial  and  capacity  restriction 
considerations  which  apply  to  all  products  produced,  its  goal  is  to 
simultaneously  select  the  desired  technology  for  each  division  while 
optimizing  corporate  profits. 

The  following  notation  is  used  in  the  problem: 

ND  *  number  of  Divisions. 

NT(i)  *  number  of  Technologies  possible  for  Division  i. 

NP(i)  *  number  of  Products  produced  by  Division  i. 

NC(i,j)  ■  number  of  constraints  in  Division  i  when  using 


Technology  j. 
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■  vector  of  products  produced  in  Division  i, 
of  dimension  NP(i) 

C£  =  contribution  of  Division  i's  products  to  the  overall 

corporate  profit  (a  vector  with  dimension  equal 
to  X . ) 

A.  “a  matrix  of  dimension  (CCx  NP(i))  which  is  the 

X 

effect  of  Division  i's  output  in  constraints. 

CC  =  number  of  common  constraints  which  tie  all  Divisions 

together. 

D. ,  =  A  matrix  of  dimension  (NC(i,j)  x  NP(i))  which 

represents  the  technology  of  Division  i  when 
Technology  j  is  selected. 

r..  ®  The  right-hand-side  constraining  Division  i,  when 

Technology  j  is  chosen. 

All  matrices  A.,  D.,  are  nonnegative  and  with  no  zero  columns.  The 
I  ij 

general  problem  is  depicted  below: 


(MIP) 


max:  c.x,  +  c«X-  +  ...  ♦  c  x 


'1"1  "2  2 


ND  ND 


( Common 

Constraint)  A  x  +  A^x  +  ...  ♦  Ax  <  b 
1^1  2^2  ND^ND 


(Division  1) 


D  x<r  orD  x  <r  or.. 

I.ri  J.I  1,2J  J.2 


•**  ^l.NTd)*!  ^  ll.NTd) 
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(Division  2) 

^2,l’‘2  ^  *^2,1  ^2,2^2  ^  ^2,2  '*• 

...  D2,NT(2)*2  ^  r2^NX(2) 


(Division  ND) 

D  x<r  orD  x<r 

ND.l^ND  ^ND,1  ND,2JJD  ^  ND,2  or  ... 

°ND,NT(ND)1^D  ^  ^ND.NKND) 

X.  >  0  for  all  i“l,  ND 

We  use  two  MIP  Representations  to  model  this  problem.  The  first  is 
a  standard  procedure  found  in  numerous  sources,  such  as  Eppen-Gould  (7]. 
The  second  representation  is  from  chapter  11  on  modelling  a  finite  union 
of  polyhedra. 

Assuming  the  maximum  production  capacity  of  each  division  is  finite 

results  in  each  constraint  set  being  a  bounded  polyhedron,  or  polytope. 

In  order  to  model  the  problem  using  the  standard  method,  the  upper  bound 
★ 

vector  X.  for  variable  x.  must  be  known,  so  that  one  can  calculate  upper 
bounds  u^j  for  each  constraint  (this  procedure  is  explained  in  detail 
later  on).  For  now  we  define  u^-  as  the  value  of  the  r.h.s.  of  each 


constraint  of  Division  i,  and  Technology  j,  when  all  products  of  Division 

* 

i  are  producing  at  this  peak  value 


“  “i.j- 


Thus  the  standard  representation  (MIPS)  is  as  follows. 


78 


°l,NT(l)’‘l  ^“l.NTd)  ~  ’^l.NKD^^l.NTd)  ^  “i.NTCD 
(S.O.S  Constraint)  +  2^  2  *  •••  ■*■  *l,NTd)  “  ^ 


(Division  ND) 


1  X  + 

(u 

-  r 

)z 

<  u 

ND.l^ND 

..ND,! 

..nd. 

1  ND,1 

^nd,i 

'nd,2!!nd 

^iiND,2 

“  ![nd. 

2^^ND,2 

^  “nD,2 

°ND,NT(ND)*ND  ^“nD,NT{ND)  ^  ]^ND,NT(ND)  ^^ND,NT(ND)  ^  ^ND,NT(ND) 
(S.O.S.  Constraint)  ♦  z^^^j  *  •  •  •  *ND,NT(ND)  '  ^ 

(Binary)  z.  .  c  {0,1}  for  all  i,j 

' » J 

For  tlie  MIPS  representation,  the  constraint  and  variable  totals 


ND 

NT(i) 

MIPS  Constraints  • 

CC  ■*■  1 

Z  NC(i,j)  ♦  ND, 

i«l 

ND 

MIPS  Variables 

I  {NP(i) 

♦  NKi)}, 

i»l 


MIPS  Binary 


ND 

I  NT(i). 


The  second  representation  is  designed  specifically  for  representing 
the  finite  union  of  a  set  of  polyhedra  via  integer  variables,  and  is  our 
sharp  representation  from  chapter  II.  For  our  division/technology 
problem,  the  polyhedra  are  actually  bounded  polytopes,  and  the  bounded- 


MIF  representation  becomes  as  follows. 


(MIPP) 


Mx:  c{xj  .  cjx  .....  cj^x 


(Common) 


Ax  ♦Ax  +  ...+A  X  <b 
2^2  ND,^ND 


*1,1  ♦  *1,2  ♦  •••  +  *1,NT(1)  “  *1 


^2,1,  ^2,2  .^2,NT(2)  ^2 


(Division  1) 


*ND,1  *  *ND,2  ]|nD,NT(ND)  “ 

°1. 1*1.1  ■  ll, 1*1.1  ^  ° 


^I’NId) J,NT(1)  ”  Il,NT(l)  *1,NT(1) 


(S.O.S.  Constraint) 


Z  +Z  ♦...♦z  ..v“l 

1,1  1,2  1,NT(1) 


(Division  2) 


°2, 1*2,1  '  12,1*2,1  ^  0 


^2,NT(2)*2,NT(2)  ~  l2,NT(2)*2,NT(2)  ^  ^ 


(S.O.S.  Constraint)  *2,1  *  *2,2  *  •••  *  *2,NT(2)  “  1 


®ND,liND,l  ”  !LnD,1*ND,1  ^  ° 
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Division  ND) 


(S.O.S.  Constraint) 
(Binary  Constraint) 


^ND,NT(ND)*ND,NT(ND)  ”  ^ND,NT(ND)*ND,NT(ND)  ^  ° 

*ND,1  *  *ND,2  *ND,NT(ND)  '  ^ 

*i,j  ® 


Notice  that  the  total  variables  and  constraints  of  the  (MIPP) 
representation  are,  or  can  be,  greater  than  for  (MIPS).  However,  the 
number  of  binary  variables  is  fhe  same  in  both  formulations.  Here  are 


some  counts: 

ND  NT(i) 

(MIPP)  Total  Constraints  *  CC  ♦  {  Z  (I  NC(i,j))  ♦  NP(i)}  ♦  ND 

i*l  j*l 

ND 

(MIPP)  Total  Constraints  *  (MIPS)  Total  Constraints  ♦  Z  NP(i) 

i*l 

ND  NT(i) 

(MIPP)  Total  Variables  •  {I  (I  NP(i))  NT(i)  ♦  HP(i)) 

i-1  j*l 

ND  NT(i) 

(MIPP)  Total  Variables  »  (MIPS)  Total  Variables  *  Z  Z  NP(i) 

i-1  j-1 
ND 

(MIPP)  Binary  Variables  »  (MIPS)  Binary  Variables  -  £  NT(i). 

i-1 

★ 

The  upper  bound  vectors  x.  used  to  determine  the  u-  values  for  the 

standard  representation  (MIPS),  are  data-dependent  on  the  constraint 

coefficients  (D.  ,).  Rather  than  explain  their  calculation,  a  small 
‘iJ 

example  best  decribes  this  computation.  Suppose  Division  1  has  the 


following  situation. 


“l.!  'l  ‘  ‘  ■'l 


d.  ,x,  *  d.  -X-  <  r. 

2,1  1  2,2  2  2 


d'  X  ♦  d'  X  <  r' 

1,1  1  1,2  2  1 


d’  X  ♦  d’  X  <  r' 

2,1  1  2,2  2  2 
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Note  that  d  is  an  element  of  D,  ,  and  d'  ,  is  similarily  an  element  of 

IC|la  *9* 

T  T 

©i  2  Liliewise  fr^,  r^l  ,  and  respectively  are  rj  j  and 

r,  -  respectively  of  MIP. 

*  * 

To  determine  Xj  and  X2  we  compare  the  following  ratios; 

★ 

Xj  *  max{min(r j/dj  j  1  ’  j  !  ^2^ ^'2 

X*  •  max{min(r,/d,_j  ; 

with  all  d.  .  and  dl  .  non-serc.  If  ad.,  or  d!.  is  eero,  the 
i.J  i.J  ij  ij 

corresponding  terms  are  omitted  (recall  that  all  matrices  are 
nonnegative) . 

★ 

We  set  each  x^  to  its  least  upper  integer  bound  (to  keep  within 
integer  data)  and  then  find  u.  .  as  follows. 


u.  .  *  0.  .X.  ,  where  x.  *  T  x.^ 
i,j  i,j  1  '  1  111 

aw'**  av  aw 


In  our  example, 


“1,1  “  ■"  ^‘’l,2^*2 

_“1,2_  •  <‘^2,1^’'**  *  f‘’2.2^*r 

u|  1  ”  “  ^‘^1,1^***  * 

_  “i,2  “  *  ^^2,2^’ 


A  sample  problem  with  3  divisions  and  2  Technologies/Division,  3 
products/Division ,  3  constraints/Technology/Division ,  and  3  common 
constraints  is  included  in  Appendix  B. 


The  idea  behind  our  approach  to  the  setting  of  upper  bounds,  is  that 

•k  4r 

the  quantities  Xj ,  x^ ,  etc.  are  the  best  valid  upper  bounds,  if  we  know 
only  that  one  of  the  alternative  divisional  constraints  hold  (but  we 
don't  know  which  one).  While  the  bounds  for  u,  ,,  u,  ,,  etc.,  may  seem 

extravagant,  actually  they  are  approached  in  the  worst  case. 

Therefore,  our  procedure  represents  a  reasonably  "tight"  bounding 
method,  which  the  user  could  quickly  implement.  The  user  might  obtain 
even  better  bounds  by  running  many  LP's,  etc.,  but  our  method  is  fast  and 
does  not  use  the  large  bounds  which  are  encouraged  by  some  authors  for 
the  standard  formulation. 

It  has  been  observed  in  practice,  that  excessively  large  bounds 
make  the  standard  formulation  virtually  useless  in  algorithms.  Thus  our 
experimental  results  concern  a  different  phenomenon,  since  we  provide 
reasonable  bounds  for  the  standard  formulation.  Moreover,  the  total 
failure  of  sharpness  in  the  usual  formulation  will  occur  even  if  optimal 
upper  bounds  are  used. 

We  look  at  three  criteria  in  evaluating  the  results  of  our 
computations.  We  monitor  the  solution  time  of  each  sample,  the  total 
number  of  nodes  of  the  branch-and-bound  tree  solves  as  LP’s,  and  the 
rat io  of  LP  relaxation  optimum  to  the  actual  mixed  integer  optimum. 

The  solution  time  of  each  test  problem  is  recorded  in  CPU  seconds. 
CPU  time  involves  only  the  actual  computer  time  used  in  solving  the 
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problem.  The  time  recorded  excludes  input  and  output  times  since  these 
are  negligible  on  large  problems. 

The  number  of  nodes  solved  as  LPs  includes  the  actual  number  of 
LP's  solved,  including  the  initial  LP  relaxation.  The  larger  the  number, 
the  further  "down"  the  tree,  or  the  larger  the  branch-and-bound  tree 
becomes  before  an  optimal  solution  is  either  be  found,  or  verified. 

Finally  the  LP  solution  to  optimal  MJP  solution  ratio  measures  the 
accuracy  of  the  initial  LP  relaxation  to  the  actual  MIP  solution.  This 
ratio  may  be  very  essential  for  problems  in  which  only  an  approximate 
solution  is  necessary,  or  for  codes  which  use  different  branching  rules, 
variable  choice  rules,  etc.  than  does  ours. 

Our  test  problems  range  from  a  3  Division,  2  Technologies  per 
Division,  3  common  constraints,  3  constraints  per  Technology,  and  3 
products  per  Division;  to  a  15  Division,  3  Technologies  per  Division 
problem  with  all  other  parameters  the  same.  While  relatively  small, 
these  problems  give  significantly  different  results  for  all  three  test 
criteria  between  (MIPS)  and  (MIPP).  The  largest  representation  contains 
153  constraints,  210  variables  with  45  discrete  (binary)  variables,  of 
which  30  are  declared  binary.  The  number  of  binary  variables  actually 
used  in  our  tests  is  less  than  indicated  in  (MIPS)  and  (MIPP).  This  is 
accomplished  by  designating  one  variable  of  each  Special  Ordered  Set 
constraint  as  continuous.  Clearly  this  does  not  affect  the  optimal 
solution,  but  will  reduce  the  number  of  binary  variables. 

The  cost  coefficients  in  the  objective  function,  and  all 
constraint  coefficients  (for  all  A  and  D  matrices)  are  randomly  generated 
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integers  with  bounds  [0,10].  The  r.h.s.  vectors  for  the  divisional 
constraints  (r.  .)  are  calculated  as. 

r.  .  •  2*(sum  of  row  coefficients)  ♦  1. 

.w*' » J 

The  vector  b  of  the  common  constraints  is  set  to  a  *(sum  of  row 
coefficients),  and  we  use  values  a*  1.1,  1.5,  and  1.9  in  our  study.  For 
the  value  a  ■  1.9,  the  common  constraints  are  virtually  not  binding,  so 
that  the  time  comparisons  become  essentially  a  comparison  of  the 
modelling  techniques. 

RESULTS .  The  following  tables  outline  the  results  of  our  test 
problems.  Table  1  contains  information  concerning  problems  in  which  the 
common  constraint  a  value  is  a*  1.1.  Similarily  Tables  2,  and  3 
respectively  have  a  values  of  1.3  and  1.9,  respectively.  Problems  in 
Table  4  are  run  using  Martin's  BANDBX  code.  For  the  Problem  column,  the 
following  notation  is  employed:  n,  -  n  means  that  there  were  ; 
Divisions,  with  n2  Technologies  per  Division.  All  problems  have  3 
products/Division,  3  common  constraints,  3  Constraints/Technology/ 
Division.  The  column  headed  (#)  gives  the  number  of  problems  in  the 
sample  for  this  size;  and  the  column  (#1)  indicates  the  number  in  the 
sample  which  are  solved  by  the  first  linear  program. 

The  results  support  our  expectation  that  the  (MIPP)  representation 
requires  fewer  branch-and-bound  subproblems  (nodes)  before  obtaining  an 
optimal  solution  than  the  (MIPS)  representation.  At  ®  *  l.l,  the  average 
node  advantage  of  the  (MlPP)  representation  ranges  from  2  to  1  for  the 
smallest  sized  problems  (3-2),  to  over  13  to  1  for  problems  (8-3).  We 
run  two  large  problems  (12-3)  and  (15-3)  a  limited  number  of  times  (only 


Table  1.  Multi-Division  Problems 
R.H.S.  Multiplier  (1.1) 


Problem 

# 

Time 

Ml 

[PP 

N0( 

les 

MIPS 

Ratio 

LP /Discrete 

MIPP 

MIPS 

Avg. 

max 

#1 

Avg. 

max . 

in 

MIPP 

MIPS 

3-2 

12 

.625 

0.39 

1.833 

3 

5 

3.75 

7 

1 

1.0005 

1.084 

5-2 

12 

1.86 

2.36 

2.5 

6 

2 

9.917 

19 

0 

1.0047 

1.098 

8-2 

12 

3.579 

11.518 

2.166 

5 

2 

29.08 

65 

0 

1.0007 

1.174 

3-3 

12 

1.638 

0.913 

3.916 

12 

3 

6.67 

13 

1 

1.0048 

1.088 

5-3 

12 

3. 154 

6.933 

3.75 

7 

2 

24.25 

57 

0 

1.00275 

1.102 

8-3 

12 

8.8A7 

33.5 

5.16 

10 

1 

69.25 

165 

0 

1.0008 

1.1336 

12-3 

9 

21.431 

484* 

4.11 

9 

0 

140* 

- 

- 

1.00168 

1.1466** 

15-3 

A 

29.1 

399* 

5 

6 

0 

382* 

•* 

1 .00046 

1.144** 

♦Only  one  sample 

**The  ratio  is  the  LP  over  the  Best  Solution  found 


Table  2.  Multi-Division  Problems 
R.H.S.  Multiplier  (1.3) 


Problem 

i^ 

Time 

M] 

[PP 

Nodes 

MIPS 

Ratio 

LP /Discrete 

MIPP 

MIPS 

Avg. 

max 

in 

Avg. 

max , 

in 

MIPP 

MIPS 

3-2 

18 

0.83 

0.69 

2.2 

4 

6 

4.9 

7 

0 

1.0066 

1.142 

5-2 

18 

1.92 

2.72 

2.3 

4 

3 

9.0 

29 

0 

1.0009 

1.118 

8-2 

18 

5.89 

18.60 

2.7 

7 

3 

?9.6 

91 

0 

1.0013 

1.174 

3-3 

18 

1.69 

1.52 

3.4 

10 

3 

9.7 

17 

0 

1.0032 

1.133 

5-3 

18 

4.20 

11.52 

3.9 

10 

1 

38.2 

118 

0 

1.0011 

1.174 

8-3 

18 

10.99 

123.5 

4.5 

18 

4 

194.7 

597 

0 

1 .0017 

1.116 

12-3 

3 

19.4 

- 

6.3 

9 

0 

- 

- 

1.00008 

- 

15-3 

3 

30.5 

• 

4 

9 

i 

— 

— 

1.00008 

• 
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one  sample  of  each  using  the  (MIPS)  representation)  to  support  our 
premise  that  the  node  advantage  increases  with  problem  size.  We  actually 
stop  the  computer  before  the  (MIPS)  model  is  solved  to  avoid  excessive 
computer  charges. 

At  a  *  1.9  the  nodal  comparison  shows  greater  advantages  for  our 
(MIPP)  representation,  ranging  from  4:1  for  problem  (3-2)  to  over  100-1 
on  problem  (8-3).  Again,  the  single  (MIPS)  models  of  size  (12-3)  and 
(15-3)  are  run  for  5  minutes  of  CPU  run  time  before  we  stop  the  code. 
Recall  that  for  a  =  1.9  the  common  constraints  have  almost  no  effect  at 
all,  and  many  problems  solve  as  LP*s  in  the  newer  (MlPP)  formulation. 

We  are  surprised  at  the  dominance  of  the  (MIPP)  model  with 
respect  to  run  time.  Only  the  smallest  problems  are  run  faster  using  the 
(MIPS)  representation.  At  a  *  1.1,  the  time  advantage  reaches  4-1  for 
problem  (8-3)  and  continues  to  increase  for  the  two  larger  samples  (12- 
3),  (15-3). 

Again  at  a  =  1.3  and  1,9  the  time  advantage  is  even  more 
significant,  reaching  12  to  1,  and  25  to  1  respectively  for  problem  (8- 
3). 

Table  4  summarizes  problems  solved  using  the  BANDBX  code.  The 
problems  are  all  of  size  10  Divisions,  3  Products  per  Division,  3 
Techtiologies  per  Division,  3  constraints  per  Technology,  and  3  common 
constraints.  The  a  value  is  1.3  for  all  problems  in  Table  4.  For  the 
five  problems  run  using  the  BANDBX  code,  our  (MIPP)  formulation  finds  the 
discrete  solution  much  faster  (in  CPU  seconds)  and  more  efficiently  with 
respect  to  branch-and-bound  node  counts.  The  standard  (MIPS) 
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Table  4.  BANDBX  Code  with  RHS  Multiplier  (1.3) 
10  Divisions,  3  Products  per  Division. 


Problem 

(10  Divisions) 

TIME  (CPU  seconds) 

Number  of  Nodes 

lp/discrete 

Ratio 

IPP 

m; 

[PS 

m; 

fPP  1 

m; 

[PS 

LP 

Total 

Lp 

Total 

TO 

Find  j 

Total 

To  1 
Find 

Total 

MIPp 

MIPS 

1 

36.95 

161.8 

2.29 

300* 

8 

222 

325* 

1 .006 

1.34** 

2 

46.37 

1 

54.6 

1.84 

258.1 

14 

261 

1 .0008 

1.12 

3 

120.4 

3.13 

300* 

240 

360* 

1 .003 

1.27** 

4 

41.4 

41.4 

2.29 

300* 

1 

I 

71 

298* 

1 

1.28** 

5 

40.0 

40.0 

2.6 

300* 

1 

1 

52 

324* 

1 

1.28** 

*did  not  solve  the  problem  before  computer  shutoff 

**ratio  is  the  LP  over  the  Best  Discrete  Solution  found 
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formulation,  as  Tabic  4  ahows,  solves  only  one  of  the  five  samples  within 
our  5  minute  CPU  time  limit.  Incidently,  we  continue  to  run  problem  #1 
for  10  CPU  minutes  without  verifying  an  integer  optimum.  Because  of  the 
time  limit  imposed,  it  is  not  possible  to  compare  exact  solution  time  or 
node  counts  between  the  two  formulations.  Even  so,  comparing  the  number 
of  branch-and-bound  nodes  solved  favors  our  (MIPP)  formulation  by  at 
least  30-1  for  the  worst  case  sample.  This  worst  case  result  occurs  in 
sample  #1  in  which  the  (MIPS)  LP/DISCRETE  ratio  is  very  high  (1.34).  The 
significance  of  the  (LP/DISCRETE)  ratio  is  also  evident  in  that  our 
(MIPP)  formulation  results  in  an  LP  relaxation  solution  within  1  percent 
of  the  optimal  discrete  solution.  On  the  other  hand,  the  best 
(LP/DISCRETE)  ratio  of  the  (MIPS)  formulation  is  1.12,  which  represents  a 
12  percent  error.  Incidently,  the  only  problem  solved  using  the  (MIPS) 
formulation  also  has  the  lowest  (LP/DISCRETE)  ratio.  With  the  advantage 
increasing  with  problem  size,  tremendous  time  savings  seem  possible  using 
our  formulation  for  even  small-sized  industrial  problems.  We  caution, 
however,  that  the  problems  in  our  study  are  randomly  generated,  and  do 
not  derive  from  an  actual  industrial  application. 

Of  course,  a  more  efficient  computer  code  would  solve  our  small 
problems  much  faster,  probably  with  fewer  nodes.  But  the  third  test 
statistic,  the  LP  relaxation  solution-to-mixed  integer  optimal  solution 
ratio,  is  invariant  with  respect  to  individual  computer  codes,  heuristics 
used,  and  whether  or  not  a  problem  is  solved  to  proven  optimality. 

For  all  o  values,  our  (MIPP)  representation  has  an  initial  LP 
solution  within  IX  of  the  final  mixed-integer  solution  value.  On  the 
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other  hand,  the  (MIPS)  approach  errors  by  at  least  BZ  on  the  average,  and 
the  error  appears  to  increase  with  problem  size. 

Section  3.4;  The  Experiment  for  Piecewise-Linear  Separable  Programs 
with  Fixed-^Charges 

The  scenario  for  these  test  problems  involves  a  manufacturer  which 
must  meet  the  demand  of  several  separate  categories  of  products.  For 
each  category,  the  company  assembles  up  to  three  end  product  categories, 
each  of  which  meets  the  specification  of  that  category,  and  only  that 
category.  Any  combination  of  end  products  in  a  certain  category  may  be 
used  to  meet  the  demand  requirement  of  that  category. 

For  example,  a  computer  may  be  designed  to  meet  certain 
specifications.  Several  configurations  of  off-the-shelf  intermediate 
products  may  represent  different  end  products,  all  of  which  meet  the 
same  specifications.  In  that  case,  all  the  end  products  fall  into  the 
same  category. 

Each  end  product  is  assembled  from  a  specific  combination  of  inter¬ 
mediate  products.  These  intermediate  products  are  common  to  all  end 
products.  The  intermediate  products  are  high  demand  items  each  with  its 
own  cost  function.  The  goal  of  the  company  is  to  meet  demand  of  each 
category  while  minimizing  its  cost  of  intermediate  products.  We  do  not 
include  assembly  costs,  i.e.  we  take  the  perspective  of  the 
intermediate-products'  manufacturer. 

The  cost  function  of  each  intermediate  product  is  a  separable  linear 
function  with  three  separate  fixed-charges.  For  instance,  the  cost 


function  of  the  i^  intermediate  product  is; 
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■ji  "jv  >■>  vji *> ■'>*>  v.t  . >  •>’..  ->  v  -.-  .c  .- 


MC 


NYt 

*i 

fi(xi) 

NXi 

n(i,j) 


Notation  for  the  problem  is  included  below: 

set  of  similar  end  products  meeting  the  specifications  of 
category  k.  (Sj^|  <  3. 

*  Total  number  of  categories 

*  demand  of  category  k. 

The  j  end  product  made  from  a  specific  combination  of 
intermediate  products. 

*  The  total  number  of  end  products . 

The  1  intermediate  product 

The  cost  function  of  the  i  intermediate  product. 
Represented  by  Eqn  (3.4.1)  and  Figure  (3.1). 

The  total  number  of  intermediate  products. 

Matrix  representation  of  the  combination  of 
required  in  the  assembly  of  each  y,. 

The  problem  is  now  modelled  as  an  optimization  problem: 

Fixed-Charge  Problem 


min: 


s.t. : 
NYt 


NXi 

I  f.(x.) 

i»l  1  1 


(Product  Mix)  (I.)  I  n(i,j)*y.  ■  x.  for  i*l,  NXi 

j»l  J  >• 


(Demand)  (II.) 


I  yi  >  dk 
all  vars.  >  0 


for  k*l,  NC 


(3.4.2) 
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The  variables  in  (3.4.2)  are  not  required  to  be  integrel*  es  it  is 
essumed  that  the  values  of  and  are  sufficiently  large  to  allow 
such  a  continuous  "approxination." 

A  standard  nethod  of  representing  a  fixed  charge  problem  involves 
removing  Che  fixed  charges  and  separately  representing  the  remaining 
function.  The  economies  of  scale  of  each  intermediate  product  results 
in  the  remaining  function  being  a  separable  linear  function  (concave), 
i.e.  Sj^  >  s2^  >  By  for  all  i«l,  KXi.  After  the  remaining  separable 
function  is  represented,  che  fixed  charges  are  represented  as  a  separate 
MIP  representation.  Finally  the  fixed  charge  representation  is 
"added-on"  to  the  separable  representation.  Thus  the  original  fixed 
charge  problem  is  the  combination  of  two  individual  MlP^representations, 
the  models  have  been  "Linked"  together. 

We  use  a  conmvon  standard  procedure  of  representing  the  separable 
function,  which  we  draw  in  Figure  3.2. 


To  model  the  piecewise-linear  function  of  Figure  3.2,  we  insert 
into  the  objective  function  this  term: 


NXi 

I  ♦ 

i-1 


(3. 4. 3. a) 


i  li  li 


*  •2i*“2i  -  »ll>  * 


*  •2i*'*2i-'li>  *  -  “2i'> 


The  following  are  inserted  in  the  constraints: 


♦  X2.*B„.  +  X3.*B,.)  -  0 
1  1  li  1  2i  1  3i 


XO.  ♦  XI.  +  X2.  ♦  X3. 

11X1 


XO.  ♦  XI. 


-01 . 
1 


Xli  ♦  X2i 


01.  +02^  ♦  03^ 


•  1 

>  0 
>  0 

-  1 


01  02^,  03 >  0  integer 


for  i«l,NXi  (3.4. 3. b) 


for  i-1, NXi 


This  modelling  (3.4.3)  is  not  hereditarily  sharp  since  e.g., 
03^  *  0  does  not-  imply  X3£  ■  0.  It  is  simply  one  which  is  similar  to 
those  in  connnon  use. 

We  represent  the  fixed-charge  step  function  using  two  standard 


techniques,  (I)  and  (II).  By  separately  "adding"  (l)  and  (II) 
representations  to  the  representation  in  (3.4.3)  above,  we  have  two 
commonly  used  MIP-representations  of  our  original  problem  (3.4.2). 


V,',‘ 


■j'lQS  V  »Si 


■S*  •>»>  • jny  •>  V  .V  •-'*  *7  ~r:^z  •- 
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For  Fixed  Charge  I,”  we  put  into  the  objective  function  the 

term: 


NXi 

i.l  Oi  1  li  1  2i  i 


and  we  put  into  the  constraints: 


(3. 4. 4. a) 


X.  -  B,.*zO.  <  0 
1  3i  1 

*i  “^®3i  -  <  B 


(3.4.4.b) 


li 


for  i*l,  NXi 


X.  -(B,.  -  B^. )*z2.  <  B  . 
1  3x  2i  1  2i 

*®i*  *^i’  *^i  ^  ®  integer 


When  (Fix-Charge  I)  is  "added”  to  (3.4.3)  we  have  our  first 
standard  MIP  representation,  we  call  the  representation  (STANDARD  I),  or 
just  (l).  Incidentally,  (3.4*4)  is  not  sharp  for  general  bounds  and 
fixed-charges.  The  second  technique  conunonly  used  to  represent  a  step 
function  adds  an  additional  variable  for  each  fixed-charge.  This 
representation  (3.4.5)  is  sharp  and  is  similar  to  the  "6  method" 
contained  in  Bradley,  Hax,  Magnanti  I5).  For  "Fixed-Charge  2,"  we  put 
into  the  objective  the  term: 


NXi 

£ 

i-1 


{f  .*z0.  ♦  f..*zl.  ♦  f  *z2  } 
0i  X  lx  i  2i  i 


(3. 4. 5. a) 


r*  • 
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•nd  tie  add  in  the  constreinte: 


X. 

1 

-  (xl .  ♦  x2.  ♦  x3.) 

1X1 

-  0 

- 

<  0 

x2i 

-  <>2i  - 

<  0 

-  "3i  -  'ai’***! 

<  0 

*li 

-  »U*“l 

>  0 

x2- 

1 

- 

>  0 

zO^, 

,  zl . ,  z2 . 

1  1 

>  0  integer 

(3.A.5.b) 


for  i“l,  NXi 


By  adding  (Fix-Charge  ll)  to  the  (3.4.3),  we  have  the  second 
common  MIP-representation  of  the  original  problem,  which  we  call  this 
(STANDARD  II),  or  (II)  for  short. 

Notice  that  the  (II)  representation  has  the  same  number  of  integer 
variables  as  (I),  but  more  total  constraints  and  continuous  variables. 
Therefore  we  expect  that,  unless  the  branch-and-bound  node  count  of  (ll) 
is  much  less  than  the  count  for  (I),  problems  will  be  solved  faster  using 
the  first  method  (1).  In  fact,  this  turns  out  to  be  the  case,  showing 
that  variable  counts  can  sometimes  overcome  a  deficiency  of  sharpness. 

Note  that  in  both  (I)  and  (ll),  the  fixed-charge  variables  are 
separable  from  the  segment  variables  Oj^,  which  results  (as  we  see  below) 
in  an  unnecessary  doubling  of  the  number  of  binary  variables.  However, 
we  cannot  validly  set,  e.g.  zO^  »  01^,  for  possibly  ■  0  ■  zO^^,  Ql^  ■ 
XO^  ■  1.  This  problem  can  be  overcome  by  an  ad  hoc  device  in  this 
specific  setting,  but  such  devices  go  beyond  what  one  would  do  in  using 
the  standard  modellings  as  they  have  been  described. 


We  next  use  the  extreme  point  form  of  our  sharp  representation  to 
model  the  entire  function  of  Figure  3.1.  Using  our  sharp  rep’-esentation 
there  is  no  need  to  separate  the  fixed-charges  as  before,  and  ve  actually 
require  fewer  integer  variables  and  fewer  total  constraints  and  variables 
than  either  of  the  common  methods. 

We  put  into  the  objective  function: 


I  {XI. *f^.  ♦  X2.*(f  .  ♦  f,.  ♦  ♦ 

X  Ox  X  Ox  lx  lx  lx 

*  *li  *  *21  **li*‘li  *  •2i*“2i  -  ‘li*  * 
112.*., .  122,.(.j,)*(Bj,  -  B,,)  * 


and  add  to  the  constraxnts: 


(III.)  (Xl2j^  ♦  X21^)*Bj^  +  (X22j  ♦  X31.)*B2^ 

♦  A32^*  Bj^  -  x^  for  i-1,  NXi 

(IV.) 

XI.  -  (Xll.  ♦  A12.)  -  0 

XXX 

X2^  -  (X21j  ♦  X22j)  •  0  for  i-1, NXi 
A3.  -  (X31.  ♦  A32.)  -  0 

XXX 

XI.  ♦  X2j  ♦  X3^  <  I 

XI X2^,  X3^  >  0  integer 

This  formulation  is  worked  in  detail  in  Appendix  A. 


(3. 4. 6. a) 


(3.4.6.b) 
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Ve  conclude  vith  •  fourth  modelling  also  involving  the  two 
separate  parts  of  the  function  of  Fig.  3.1.  This  modelling  differs  from 
Standards  I  and  II,  in  that  both  parts  are  separately  done  by  our 
"extreme  point  representation"  (as  described  in  Appendix  A),  so  each  part 
separately  exhibits  sharpness  and  hereditary  sharpness.  This  fourth 
modelling  is  called  "SEMI-SHARP"  (for  lack  of  a  better  name!),  and  it  is 
in  the  comparison  between  SHARP  and  SEMI-SHARP  that  we  isolate  the  issue 
of  modelling  linkage.  The  other  comparisons  provide  information  which, 
while  involving  linkage,  also  are  compounded  with  other  effects. 

SEMI-SHARP  uses  this  formulation  for  the  piecewise-linear  part  of 
the  function: 


NXi 

I  {X12.  ♦  X2.  +  A3.s,.*B,. 

1  1  X  li  li 


(3.4.7..) 


is  added  into  the  objective  function,  while  these  relations  are  added  as 
constraints  for  i*l  ,...,  HXi: 


X. 

1 


(X12.  ♦  X21.)B,.  ♦  (X22.  ♦  X31.)B,.  ♦  X32.B,. 
X  1  lx  X  X  2i  X  3x 


(3.4.7.b> 


XU  . 

♦  X12. 

«  XI. 

X 

X 

X 

X21 . 

♦  X22. 

-  X2. 

X 

X 

X 

X31i 

X32i  ■ 

100 


-  I 

Xli,  X2^ ,  X3^  are  integer 


Also,  all  variables  in  (3.4.7.b)  are  nonnegative. 

SEMI-SHARP  uses  this  formulation  for  the  fixed-cost  part  of  the 
function: 


jf,  ‘  .  £,j) 

‘  "  'li  *  f2i’> 


(3. 4. 8. a) 


is  added  into  the  objective  function,  while  these  relations  are  added  as 
constraints  for  all  i: 


«  (012^  ♦  +  (022^  ♦  ®^*i^®2i  ®^^i®3i  n.4.8.b) 

Oil .  +  012.  -  01  . 

1  IX 

021.  ♦  022.  *  02. 

XXX 

031.  ♦  032.  -  03. 

X  XX 

01^  ♦  02^  +  03^  <  1 
01.,  02.,  03.  integer 

1  X  X 

Also,  all  variables  in  (3.4.8.b)  are  nonnegative. 

For  this  problem,  the  sharp  representation  requires  one-half  of  the 
integer  variables  than  those  required  by  either  of  the  standard 
representations. 
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While  the  primary  goal  of  these  experiments  involves  "modelling 
linkage"  comparisons,  we  also  compare  a  heuristic  commonly  used  in 
applications  of  mixed  integer  programming.  This  heuristic  attempts  to 
reduce  the  number  of  branch-and-*bound  nodes  solved  in  verifying  a 
feasible  mixed  integer  solution  within  2Z  of  the  optimal  solution.  The 
heuristic  fathoms  a  branch-and-bound  limb  if  the  current  problem  solution 
is  either  infeasible,  integer-feasible,  o£  within  2X  of  the  best  known 
mixed  integer  solution. 

Several  problems  are  tested  using  three  different  mixed  integer 
programming  codes;  Land-Powell,  C.  H.  Martin's  BANDBX,  and  CDC's  APEX  IV. 
Test  Problems 

The  test  problems  generated  have  the  following  parameter 
characteristics: 

NXi  •  5,  6,  10 
NYt  •  2*NXi 
NC  -  lNYt/3] 


[100,  200]  integer,  random 
n(i,j)  e  10,  51  integer,  random 


s  .  t  [1,5]  real,  random 
li 


RES  -  0.8 


Sj  -  {1,2,3} 
Sj  •  {^.5,6} 
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Sj^  •  {remaining  end  products) 

NC 

B  ■  I  (d  /|s  |)  *  (  I  n(i,j)),  integer 
k-1  ^  jcS^ 

Bj.  ((1/2  *  B^.)! 

•3i  ■  ^*^2i 
*0i  ■ 

*U  *  •2i*‘'2i  -  »li» 

«2i  -  IP/(1-P)*  -  Bji)J 

P  -  0.1,  0.5 

For  our  problems  "p"  represents  the  percentage  of  total  cost  which 

is  a  fixed'Charge.  In  other  words,  a  low  value  of  "p”  results  in  the 

fixed-charges  being  a  relatively  small  portion  of  the  total  cost  of 

each  intermediate  product.  Our  calculations  are  based  upon  the  following 

principle.  When  the  amount  of  x.  used  is  at  either  B. . ,  B,...  or  B  . . 

1  li’  2i’  3i’ 

then  "p”  represents  the  actual  fixed-charge  percentage  of  the  total  cost. 
Clearly,  at  any  point  other  than  the  three  above,  and  x^*0,  the  fixed- 
charge  percentage  of  total  cost  will  be  greater  than  "p”  so  long  as  Sj^, 
S2£and  s^£  are  strictly  greater  than  zero. 

To  ensure  each  intermediate  product  is  utilized,  we  eliminate  zero 
columns  from  matrix  n.  Notice  that  the  calculation  of  B  .  insures  a 

2i 

feasible  solution  with  each  x^  at,  or  near,  B^^. 
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We  run  problems  with  as  many  as  15  intermediate  products  (NXi*15). 
While  not  large  problems,  they  are  very  difficult  to  solve. 

RESULTS; 

Practice  has  sho%m  the  fixed-charge  problem  to  be  an  extremely 
troublesome  problem  to  solve  using  the  branch-and-bound  technique. 

Through  our  sharpness  property,  we  know  that  the  difficulty  is  caused  by 
the  inherently  poor  "relaxed”  approximation,  even  using  the  best  possible 
"sharp"  representation.  Our  test  problems  show  that  the  problems  remain 
difficult  even  using  our  sharp  representation,  and  also  that  there 
remains  an  advantage  to  these  representations.  Even  the  "best  possible" 
linear  relaxation  is  evidently  fairly  inaccurate  as  an  approximation  to 
the  fixed-charge  functions. 

Tables  6-9  represent  samples  run  using  Land-Powell 's  branch-and- 
bound  Mixed  Integer  code.  Problems  of  Tables  10  and  II  were  run  using 
Martin's  BANDBX  code,  and  CDC's  APEX  IV  was  used  for  the  problems  in 
Table  12. 

Our  first  experiment  is  reported  in  Table  6,  and  does  not  involve 


the  SEMI-SHARP  modelling. 


OD  OB  OD 


# 

SAMPLES 


5 


TIME 


# 

NODES 


DISCRETE/ 
LP  RATIO 
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Table  6  shows  that  small  problems  are  difficult  to  solve,  and  that 
^ur  combined  representation  is  superior  in  all  three  categories.  We  also 
notice  that,  even  though  the  problems  with  small  fixed-charges  (p«0.1) 
have  more  accurate  initial  LP  values  (22%  versus  36T  using  our  combined 
model),  the  problems  are  actually  harder  to  solve  than  those  with 
significant  (p  ■  0.5)  fixed-charges. 

The  degree  of  fixed-charges  does  not  appear  to  affect  the 
computational  advantage  of  our  combined  method  Cw.r.t.  nodes  and  time). 

Aside  from  the  indication  that  the  combined  representation  is 
superior  to  either  standard  method,  it  is  interesting  to  note  that  the 
Standard  II  representation  appears  to  give  sharp  initial  LP  relaxation 
solutions.  The  Discrete/L.P.  ratios  are  identical  to  those  of  our  sharp 
representation  (the  L.P.  solutions  are  identical).  But  after  starting  at 
a  much  closer  solution  than  Standard  I,  the  total  nodes  required  is 
virtually  the  same  for  both  I  and  II.  Furthermore,  even  if  the  initial 
(II)  solution  is  sharp,  it  requires  more  nodes  than  our  sharp 
representation. 

In  our  second  experiment  for  these  fixed-charge  problems,  we 
retain  the  better  of  the  two  "textbook- like"  formulations  from  the  first 
experiment  (this  was  STANDARD  I),  and  run  it  together  with  our  SHARP  and 
SEMI-SHARP  representations.  We  also  gather  more  data  than  before.  The 
results  are  reported  in  Tables  7,  8,  and  9. 


Table  7.  Seven  Problems  with  p*0.3,  NXi*5 
Using  Land-Powell . 


Formulation 

Avg.  Time 
to  LP 

Avg.  Time 
to  Find 
Optimum 

Total 

Time 

Number 
of  Nodes 

Discrete/LP 

SHARP 

1.3  sec. 

14.3 

18.7 

63.3 

1.26 

I 

2.0 

8.7 

41.3 

84.4 

1.50 

SEMI-SHARP 

3.8 

30.0 

60.9 

79.3 

1.26 

Table  8.  Problems  with  p*0.3,  llXi»6, 
Using  Land-Powell. 


Formulation 

Avg.  Time 
to  LP 

Avg.  Time 
to  Find 
Opt imum 

Total 

Time 

Number 
of  Nodes 

Discrete/LP 

SHARP 

2«0  sec. 

22.6 

36.5 

94.8 

1.32 

I 

3.0 

40.3 

120.2 

176.6 

1.56 

SEMI-SHARP 

5.5 

25.2 

129.2 

119.2 

1.32 

Table  9.  A  Hard  Problem,  p“0.3,  NXi*6, 
Using  Pand-Powell. 
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In  Table  7,  it  is  interesting  to  note  that  STANDARD  1  actually 
outperforms  SEMI-SHARP  in  terms  of  the  "Total  Time"  criterion  (as  well  as 
the  average  time-to-find  an  optimum).  However,  as  the  number  of  binary 
variables  increase  from  30  to  36  (NXi  -  5  to  NXi  •  6),  the  gap  in 
performance  noticeably  diminishes.  He  conjecture  that  these  problems 
are  simply  too  small  for  the  SEMI-SHARP  modelling  to  have  an  advantage. 

The  SHARP  modelling  is  superior  in  the  "Total  Time"  criterion, 
although  for  the  set  of  smaller  problems  (NXi*5)  it  is  somewhat  slower  in 
locating  a  optimum.  The  SHARP  modelling  also  has  somewhat  fewer  nodes  in 
the  search  tree,  but  this  relative  advantage  is  not  of  consequence.  Of 
course,  the  SHARP  modelling  has  a  better  Discrete/LP  ratio,  but  it  is  far 
too  large  to  permit  heuristic  solutions  at  reasonable  levels  of  error  CO¬ 
AX)  for  industrial  problems. 

A  harder  problem  with  NXi*6  is  separately  reported  in  Table  9.  It 
is  solved  to  optimality  only  by  the  SHARP  formulation,  although  all  three 
formulations  do  find  the  optimum. 

Tables  10,  11,  12,  and  13  compare  our  SHARP  and  SEMI-SHARP 
formulations.  Table  10  compares  three  problems  of  sise  NXi*3  using 
Martin's  BANDBX  code.  The  results  in  Table  10  indicate  the  dominance  of 
our  hereditarily  SHARP  formulation  over  the  SEMI-SHARP  formulation.  The 
SHARP  formulation  is  at  least  3  times  more  efficient  in  node  counts  and  8 
times  faster  in  total  CPU  time. 

Tables  11,  12,  and  13  represent  the  larger  (NXi"10,  NXi*lS) 
samples.  Table  11  contains  the  results  using  our  "2X  Heuristic,"  Table 


'9 
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Table  10.  Three  Problems  With  NXi^S* 
Using  BANDBX. 


Time  (CPU  ae( 

.onds) 

Modes 

To 

find 

To  Find 

Discrete/LP 

Problem 

LP 

Discrete 

Total 

Best  Dscrete 

Total 

Ratio 

1  SHARP 

0.55 

3.19 

6.82 

15 

25 

1.36 

SEMI-SHARP 

1.68 

31.2 

5*.4 

56 

78 

1.38 

2  SHARP 

0.58 

7.05 

10.87 

26 

38 

1.35 

SEMI-SHARP 

1.58 

49.7 

95.87 

81 

133 

1.35 

3  SHARP 

0.57 

1.74 

4.67 

10 

19 

1.23 

SEMI-SHARP 

1.64 

59.49 

126.32 

82 

155 

1.23 

Table  11.  Six  Problems  With  NXi-10 

With  2%  Heuristic,  using  BANDBX. 


Time  (CPU  seconds) 


Nodes 


Problem 

1  SHARP 
SEMI- SHARP 

2  SH^RP 
SEMI-SHARP 

3  SHARP 
SEMI-SHARP 

4  SHARP 
SEMI-SHARP 

5  SHARP 
SEMI-SHARP 

6  SHARP 
SEMI-SHARP 


Discrete/LP 


To  Find 

Total 

To  Find 

Total 

Ratio 

52.32 

118.82 

69 

128 

1.33 

380.6 

600* 

177 

241* 

1.33 

24.2 

400* 

3b 

357* 

1.31 

405.3 

600* 

192 

268* 

1.29 

36.6 

267.6 

48 

229 

1.26 

246.0 

600* 

123 

246* 

1.26 

22.9 

202.8 

30 

186 

1.29 

— 

600* 

— 

No  Discrete 

40.28 

305.0 

51 

276 

1.31 

346.1 

600* 

131 

197* 

1.31 

9.46 

505.5 

16 

429 

,  1.27 

341.3 

600* 

116 

183* 

1.26 

*The  Branch-and-bound  Algorithm  automatically  stops  after  the  cpu  time 
indicated  to  avoid  excessive  computer  changes.  Thus,  Total  Time  and 
Total  Node  columns  summarise  the  conditions  as  the  algorithm  stopped. 
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Table  12.  Six  Problems  With  NXi*10 

Exact  Algorithm,  Using  BANDBX. 


i 


Problem 

Time  (CPU 

To  Find 

seconds) 

Total 

Nodes 

To  Find 

Total 

Discrete/LP 

Ratio 

1  SHARP 

57.37 

128.99 

74 

139 

1.33 

SEMI-SHARP 

544.2 

600* 

237 

251* 

1.33 

2  SHARP 

I 

32.7 

600* 

42 

427* 

1.31 

SEMI-SHARP 

407.3 

600* 

192 

244* 

1.29 

3  SHARP 

37.6 

315.5 

48 

274 

1.26 

SEMI-SHARP 

248.7 

1  600* 

123 

243* 

1.26 

4  SHARP 

28.83 

274.5 

i  30 

1  214 

1  1.29 

SEMI-SHARP 

528.1 

600* 

241 

262* 

1.31 

5  SHARP 

90.48 

346.4 

82 

266 

1.30 

SEMI-SHARP 

369.12 

600* 

131 

243* 

1.30 

6  SHARP 

22.12 

418.1 

26 

310 

1.25 

SEMI-SHARP 

330.8 

600* 

135 

228* 

1.25 

♦(see  previous  page) 


*  •  *.•  '  • 


A'.**\N'k'' 
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12  contains  identical  problems  run  using  an  exact  algorithm.  To  read  the 
table,  notice  that  the  column  entitled  "Best  Discrete  Solution  to  :nd"  may 
not  be  the  exact  optimal  solution  of  the  problem.  The  22  heuristic  may, 
and  often  times  does  fathom  a  limb  of  the  branch-and-bound  tree  belore 
the  actual  exact  optimum  is  found.  Correspondingly,  the  "NODES  TO  FIND" 
column  indicates  the  number  of  nodes  required  to  find  the  best  discrete 
solution  found.  The  most  prominent  result  of  the  samples  is  that  the 
SEMI-SHARP  formulation  is  unable  to  solve  any  of  the  samples  to 
optimality,  or  even  to  within  22  of  optimality  (using  the  22  heuristic) 
within  10  CPU  minutes  of  computer  time.  The  next  roost  prominent  feature 
is  the  difficulty  of  the  problems  even  for  the  SHARP  formulation.  Notice 
the  high  (LP/DISCRETE)  ratios.  (We  look  at  the  relationship  between  the 
LP  relaxation  proximity  to  the  Discrete  solution  and  problem  difficulty 
in  the  next  chapter). 

We  also  find  that  the  22  heuristic  has  a  very  minor  effect  upon 
problem  solution.  While  finding  the  best  available  discrete  solution 
faster  and  more  efficiently  than  the  exact  algorithm,  the  heuristic 
algorithm  does  not  always  terminate  as  fast  or  as  efficiently.  Detailed 
computer  printouts  indicate  that  when  the  heuristic  causes  a  branch-and- 
bound  limb  to  fathom  before  reaching  the  exact  optimum  (problems  !!‘2,  3, 
6),  the  algorithm  may  search  more  nodes  before  terminating  with  a  soluton 
guaranteed  to  be  within  22  of  the  exact  optimum. 

Table  13  summarises  three  problems  run  to  exact  optimality  using 
APEX  IV.  The  first  problem  is  identical  to  problem  l>6  of  Table  !1  and 
Table  12.  Notice  that  the  times  are  actually  APEX  Units  and  cannot  be 


*.*•*«*•"»' •*«^*»***a^*«^  k"**  V*  e"'  •*’  ‘  • 


size 

Time 

(APEX  Units) 

Nodes 

Discrete/LP 

Problem 

NXi« 

LP 

Total 

To  Find 

Total 

Ratio 

1  SHARP 

10 

2.25 

92.0 

192.1 

137 

315 

1.25 

SEMI-SHARP 

4.15 

617.0 

972.7* 

699 

2113* 

1.25 

2  SHARP 

10 

2.31 

90.237 

186.9 

182 

373 

1.26 

SEMI-SHARP 

4.14 

150.1 

2305* 

135 

2668* 

1.26 

3  SHARP 

15 

3.5 

199.6 

1458* 

229 

1791* 

1.25 

SEMI-SHARP 

7.2 

615.1 

2489* 

420 

2003* 

1.25 

IIA 

directly  compared  to  CPU  seconds,  even  though  we  have  found  APEX  units  to 
be  virtually  identical  to  CPU  seconds  in  our  experiments.  Again,  even 
though  SEMI-SHARP  produces  an  identical  LP  relaxation  solution  as  the 
SHARP  formuation,  its  (SEMI-SHARP's)  lack  of  hereditary  sharpness  causes 
the  code  to  search  much  longer  to  even  find  a  feasible  integer  solution. 
Since  the  SEMI-SHARP  formulation  does  not  solve  the  problems,  exact 
comparisons  are  unavailable.  However,  the  SHARP  formulation  is  at  least 
7  times  more  efficient  and  over  10  times  faster  (problem  f/2).  The  third 
problem  is  the  largest,  NXi=15,  and  the  results  display  the  difficulty  of 
these  type  problems. 

The  results  of  our  tests  indicate  a  definite  computational 
advantage  of  our  hereditarily  SHARP  formulation  over  both  non-sharp  and 
non-hereditarily  sharp  formulations.  The  advantage  appears  to  increase 
with  problem  size,  however  the  difficulty  of  the  problems  prevents  us 
from  confirming  our  hypothesis. 
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appendix  A: 


TWO  FORMULATIONS  FOR  BOUNDED-MIP  REPRESENTABILITY 


Suppose  that  a  finite  union  S  of  polyhedra  S  ■  P^  U  •••U 
satisfies  the  condition  S)  on  recession  directions,  from  the  Theorem  of 
Section  1.  Then  by  that  Theorem,  S  is  bounded-MIP  representable.  In  fact, 
in  chapter  II  we  provided  two  mixed'integer  formulations  for  the  set  S:  the 
"polyhedral"  form,  and  the  "extreme  point"  form.  The  "polyhedral  form"  is 
given  as  follows.  If  Pj^  *  {x  (  A*'  x  >  b*'}  for  h*l  s  is  a  polyhedral 

representation  of  P^^,  then  S  is  represented  by 


X  «  Z  x‘ 
h-1 


(Al.l) 


h  h  h 

Ax  >  b  A^,  h*l  • .  •  • ,  5 


Z  A  *  1 ,  A  ^0  forh“l  ,...,  s 


A.  integer 
h 


(Al.l)  is  the  form  given  by  Balas  I2]  for  the  constraints  of  a 
disjunctive  program.  It  is  also,  after  algebraic  simplifications,  the  dual 
to  the  linear  system  of  the  co-proposition  of  a  statement 
‘  h  h 

V  (A  X  >  b  )  in  "disjunctive  normal  form"  (for  definition,  see  (13])  where 


"V"  indicates  the  "or"  connective,  or  disjunction  (for  a  proof  of  this  fact, 
see  flO]). 
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Let  us  consider  first  a  "distribution  center"  formulation.  Here, 
if  any  flow  from  the  center  (a  <  j  <  t)  is  positive,  the  center  must 
be  built,  and  so  a  "fixed  charge"  of  c  >  0  assessed.  Using  the  variable  z 
for  the  charge,  we  have  t*2  and  S  •  ^  ^2*  »•••» 

>  0  and  all  x^  *  0},  Pj  *  {(z»  Xj  »•••,  |  x  >  c  and  all  0  <  x^  <  M^}. 

Indeed,  S  *  epi(f)  where 


r  1  if  any  X .  >  0, 

•  «t>  ■< 

I  0  if  all  X .  •  0 
J 


where  all  x.  >  0, 

and  X.  < 

J  J 


(A1.2) 


We  seek  a  modelling  of  the  function  f,  assuming  a  minimizing  objective 
function  in  which  ♦f  occurs. 

To  obain  on  sharp  modelling,  we  utilize  the  polyhedral  form 
(Al.l),  and  we  obtain: 


(z.  Xj  ,...,  x^) 
1 

z  >  O'Xj 
XI  *  0*Ai, 


.11  1.  2  2  2 
(z  ,  Xj  ,...,  x^)  ♦  (z  ,  Xj  ,...,  x^) 

2  ^ 

*  >  X2  *0 

2 

0  <  xj  <  X2 ‘Ml 


(A1.2) 


x;  -  o.Aj, 


0  <  x‘  < 


Aj  *  Aj  -  1,  Xj  >  0,  >  0 


X^  and  X2  integer 


Upon  simplification  (which  is  optional  for  use  in  algorithms,  but  will  make 
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2  2 

the  modelling  more  perspicuous),  ve  hsve  ■  x>  for  all  j,  X.*c  >  z  >  z, 

J  J  • 

and  (Al.2)  gives  constraints 


0  <  X,  <  M. ‘z 
J  J 


(A1.3) 


where  '*'C*z  is  to  be  entered  in  the  (minimizing)  objective  function.  We  note 

that  (A1.3)  is  different  from  the  formulation  Xj  x^  <  M»z,  with  M  * 

that  is  often  suggested,  and  which  is  not  sharp. 

The  “extreme  point*'  formulation  requires  that  we  write  each 

polyhedron  as  the  sum  of  the  convex  span  of  points  . ^o(h),h^ 

which  may  be  dependent  on  h  (and  which  may  be  extreme  points,  for  example) 

1  0 

and  a  convex  cone  generated  by  elements  w  w  which  are  to  be 

independent  of  h.  While  the  condition,  that  the  P.  all  have  the  same 

n 

recession  directions,  is  not  satisfied  by  all  unions  S  ■  P^U  ... 
representing  S,  it  is  satisfied  by  some  union,  when  S  is  bounded-MIP' 
representable,  as  noted  in  Section  3.2. 

The  extreme  point  formulation  of  S  is 


s  o(h)  .  S 

r  E  X  .v”^  +  r  ©,wJ 


h«l  i-1 


j«l  J 


S,i  "  ® j  ^ 


.  >  0 


(Al.3) 


X  «  I  X  . 
h  ..J  hr 


E  X  «  1 
h«l  ^ 


all  X^  integer 

n 
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The  extreme  point  formulation  ha$,  generally,  fewer  constraints  but  more 
variables  than  the  polyhedral  formulation.  From  a  practical  perspective, 
the  extreme  point  formulation  cannot  be  used  when  any  one  of  the  polyhedra 
is  a  "large  dimensional  cube,"  or  any  other  polyhedron  with  a  huge  number  of 
extreme  points.  For  example,  we  would  not  use  it  in  the  distribution  center 
problem,  with  t  very  much  more  than  t«3  or  4,  since  is  a  hypercube. 

In  function  formulations  where  S  *  epi(f)  and  f  is  defined  only  on 
a  bounded  domain,  the  sole  recession  direction  is  the  (upward)  vertical  one. 
Since  the  function  is  to  be  entered  into  a  minimizing  objective  function,  at 
a  minimum  the  recession  direction  is  not  used,  so  in  our  formulation  work  we 
often  ignore  it. 

We  conclude  by  indicating  the  derivation  of  the  extreme  point 
formulation  of  the  function  of  Figure  1.  We  have  S  *  epi(f) 

■  U  P2 ^3^  ^4*  where  the  polyhedra  and  their  extreme  points  are  as 
follows : 


Polyhedron 


Extreme  Points 


Pj  ■  {(z,x)  I  e  >  0,  *  ■  0) 


P2  ■  1  0  <  X  < 


*  >  foi  ♦  (sii)x) 


(0,0) 

^0 

'a’ 

^2 

"oi  *  'a  ♦  "a><*a'-‘''a'- 

‘‘oi  -  *a  *  "a’'*a>‘ 

'•2i>"2i-'a>'  '21' 

^22 
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APPENDIX  B 


ILLUSTRATION  OF  THE  CALCULATION  OF  UPPER  BOUNDS  IN  THE 
STANDARD  FORMULATION 


Sample  ■  3  divisions 

2  technologies/division 

3  const ./division/tech. 
3  common  constraints 

3  products/division 


max: 

xl  ♦  9x2  +  8x3  +  9x4  ♦  10x5  ♦  10x6  ♦  6x7  ♦  4*8  ♦  7x9 
S.t.: 

(common)  (1)  8x1  ♦  9x2  ♦  4x3  ♦  2x4  ♦  3*5  ♦  8*6  ♦1x7  +  8x8  +  8x9  <  103* 

(2)  5x1  ♦  3x3  ♦  7x4  ♦  6x5  ♦  6x7  ♦  8x8  +  x9  <  73* 

(3)  1x1  ♦  3x2  ♦  3x3  ♦  3x4  ♦  5x5  +  7x6  ♦  7*7  +  9x8  +  7x9  <  91* 


Division  1 


(4) 

8x1  +  5k2  ♦  7x3  <  41 

(7) 

7x1  ♦  1x2  ♦  8x3  <  33** 

(5) 

4x1  3x2  +  7x3  <  29 

or 

(8) 

8x1  ♦  2x2  ♦  8x3  <  37 

(6) 

2x1  ♦  4x2  ♦  9x3  <  31 

(9) 

7x2  ♦  x3  <  17 

(10) 

8x5  ♦  7x6  <  27 

(14)  5x4  ♦  8x5  ♦  4x6  <  35 

(11) 

4x5  ♦  2x5  ♦  6x6  <  19 

or 

(15)  3x4  ♦  2x5  ♦  6x6  <  23 

(12) 

1x4  6x5  ♦  2x6  <  35 

(16)  8x4  3x5  ♦  3x6  <  29 

(18) 

4x7 

8x8  ♦ 

7x9 

< 

39 

(21) 

3x7  ♦  8x8 

23 

(19) 

6x7 

♦  8x8  ♦ 

3x9 

< 

35 

or 

(22) 

8x7  ♦  3x8  ♦ 

8«9 

39 

(20) 

5x7 

♦  8x8  ♦ 

6x9 

< 

29 

(23) 

5x7  3x8  ♦ 

3x9 

< 

23 

*  r.h.s.  ■  oi*(sum  coeff)  ♦  1,  here  o  ■  2. 

**  r.h.s.  ■  2  (sum  of  coeff)  ♦  1. 


r.  e  .  -■  '  .  ■  .  •  -  -  •  / 
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For  our  example,  vt  can  put  these  bounds  in  Division  1: 


x(l)  ■  min  { 


x(2)  ■  min  { 


41  I  I  29  I  j  31  I 

-T  *  1  "5-  I* 

1“}  H' 


)  ■  5  R(l)  •  max  {5,  4)4-  1  *6 


x(l)  ■  min  { 


x(2)  *  min  { 


W’  L-J-W"’ 


LtJ-  L"d'  [‘"’J 


R(2)  »  max  {7,2}  +1*8 


>  •  i 


x(l)  ■  min  { 


xf2)  »  min  { 


L-1 W-W” 

1^3/^.  |37/8j,  j^7/lj)  • 


)  »  3  R(3)  *  max  {3,4}  ♦  1  ■  5 


The  Upper  Bounds  are: 

UB(4):  8(6)  +  5(8)  +  7(5)  ■  123  UB(6):2(6)  ♦  4(8)  ♦  9(5)  -  89 

UB(5);  4(6)  ♦  3(8)  *  7(5)  -  83  UB(7):7(6)  ♦  1(8)  ♦  8(5)  ■  90,  etc. 
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In  the  Standard  Form,  the  Division  Constraints  become: 


(1) 

Division  1 

** 

8x1  ♦  5x2  ♦  7*3 

♦ 

82zl 

< 

123 

(4) 

7*1 

♦ 

1x2  ♦ 

8x3  ♦ 

57z2 

< 

90 

(2) 

4*1  ♦  3*2  ♦  7*3 

♦ 

54zl 

< 

83 

(5) 

8*1 

♦ 

2*2  + 

8*3  + 

67z2 

< 

104 

(3) 

2x1  ♦  4*2  ♦  9*3 

♦ 

58zl 

< 

89 

(6) 

♦ 

7x2  ♦ 

1x3  + 

44z2 

61 

(7)  rl  ♦  r2  •  1 


where  **  coefficients  of  discrete  variables  are  found  as  below: 

82  «  (Upper  Bound  -  R.H.S.)  -  (123  -  41)  »  82 

(  83  -  29)  •  54 
(  89  -  31)  -  58,  etc. 
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CHAPTER  IV 

ONE  TEST  OF  THE  PROXIMITY  OF  THE  LINEAR  RELAXATION 
AS  A  GUIDE  TO  PROBLEM  DIFFICULTY 


Introduction 

The  purpose  of  this  chapter  is  to  add  further  confirmation  to  a 
commonly  accepted  viewpoint,  that  the  proximity  of  the  linear  relaxation 
is  a  gauge  of  problem  difficulty. 

From  our  earlier  results  we  see  that  the  Multi-Divisional 
problems,  whose  LP  relaxation  optimum  is  within  1  percent  of  the  integer 
optimum,  are  much  easier  to  solve  than  the  fixed-charge  problems  whose 
LP  relaxation  optimum  is  not  within  25  percent  of  the  integer  optimum. 

In  this  chapter  we  present  two  classes  of  problems,  of  similar 
construction,  whose  LP  relaxations  are  very  dissimilar  in  terms  of  their 
proximity  to  the  convex  span  of  the  integer  solutions.  We  then  compare 
solution  difficulty  of  the  two  problems  using  Martin's  BANDBX  system. 

The  first  class  of  problems  is  a  slight  variation  of  the  well- 
known  fixed-charge  problem.  The  second  class  of  problems  is  of  the 
fixed-benefit  variety,  in  which  a  benefit  is  received  if  a  product  is 
utilized. 

Both  problem  sets  involve  "minimum  usage  levels"  which  activate 
the  fixed-charge  or  benefit,  as  well  as  upper  bounds  on  the  value  of  the 
variable.  These  two  parameters  determine  how  close  the  LP  relaxation  is 
to  the  MIP  formulation.  In  the  ’usual'  setting  of  these  parameters,  the 
usage  level  is  much  smaller  than  the  upperbound.  For  usual  scenarios. 


4.’  • 
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the  fixed-charge  LP  relaxation  feasible  set  is  much  larger  than  the 
feasible  set  of  the  MIP  formulation,  even  when  the  best  possible  (i.e. 
sharp)  formulation  is  used.  On  the  other  hand,  the  LP  relaxation 
feasible  set  for  the  fixed— be:iefit  problem  is  usually  nearly  identical 
to  its  integer  feasible  set.  However,  by  altering  the  minimum  usage 
level  so  that  it  is  much  larger  than  usual,  the  fixed-benefit  LP 
relaxation  feasible  set  becomes  noticeably  larger  than  its  MIP  feasible 
set.  Thus  we  are  able  to  test  how  problem  difficulty  varies  as  the 
differences  between  the  LP  relaxation  feasible  sets  and  MIP  feasible  sets 
are  altered.  Since  we  proved  earlier  that  the  LP  relaxation  feasible  set 
of  any  SHARP  formulation  is  exactly  the  closure  of  the  convex  hull  of  the 
MIP  representable  set  S,  we  are,  therefore,  testing  the  effect  on  problem 
difficulty  as  the  size  of  the  convex  hull  of  a  MIP  representable  set 

changes . 

We  remark  that  we  use  both  minimum  usage  levels  and  upper  bounds  in 
our  problems,  since  the  fixed  charge  problems  require  bounds,  while  the 
fixed-benefit  problems  require  usage  levels,  in  order  to  be  formulated  as 
MIPS.  By  giving  both  problem  sets  both  usage  levels  and  bounds,  they  are 

more  comparable. 

Problems 

The  general  problem  description  for  both  the  fixed-charge  problem 
(FC)  and  the  fixed-benefit  problem  (FB)  have  the  mathematical  form. 
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I  (c .X  .  ♦  f .(x  .  )  } 
j=l  J  J  J  J  ^ 


(4.1) 


s.t. :  Ax  >  b 


X  >  0 


where  fj(Xj)  is  a  nonlinear  function  defined  as  follows.  For  the  fixed" 


charge  case, 


(FC)  f 


To  if  X.  ■  0 
.(x.)  J 

^  ^  IS.  if6.  <x.<M. 

J  J  J  J 


for  all  j 


(4.2) 


and  for  the  fixed-benefit  problem. 


(FB)  f. 


f  0  if  X.  »  0 
.(x.)  -<  J 

^  ^  1  -s .  if  6.  <  X .  <  M. 

^  J  J  J  J 


for  all  j. 


(4.3) 


In  both  (4.2)  and  (4.3),  is  nonnegative,  6^  is  strictly  greater  than 
zero,  and  Mj  is  greater  than  6^. 

A  typical  example  is  the  product  mix  problem  in  which  the  objective 
IS  to  minimize  the  cost  of  resources  while  satisfying  certain  quality- 
control,  or  demand  constraints.  The  cost  of  the  resources  involves  a 
linear  portion  (c.x.)  as  well  as  the  nonlinear  f  (x  )  term.  In  the  FC 

•1  J  J  j 

case,  an  additional  charge  of  Sj  is  incurred  if  the  amount  of  resource 
j(Xj)  falls  within  the  16^,  interval.  In  this  case,  6^  is  the 
minimum  usage  level  and  Mj  is  the  maximum  usage  level.  In  the  FB  case,  a 
benefit,  or  negative  cost  is  received  if  resource  j  falls  within 
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the  [6.,  M.)  interval.  Note  that  a  value  of  x.  strictly  between  0  and  { 
J  J  J  j 

is  not  permitted. 

n 

Defining  the  set  as  epi{f^(x^)),  and  S  ■  U  S^,  we  see  that  S 
is  KIP  representable.  The  graph  of  Sj  for  both  problems  follows. 


tj(x) 


Figure  4.1 

Fixed-Charge  vs.  Fixed-Benefit 


Using  the  "extreme  point"  sharp  modelling  for  the  FC  problem  we  add 
to  the  constraints  in  (4.1), 


X.  »  0,.  6.  ♦  G-.M. 

J  JJ  J  2j  j 

(FC)  X.  •  e, .  ♦  ©,. 

J  Ij  2j 

A,  c  {0,  1),  G,  G...  >  0 


(4.S) 


■nd  rcpUct  in  the  objective  function  with  the  term  X.ee  .  The 

}  i 


•odel.io,  i„  the  FB  ce.e  i.  identic.)  except  (-X,*.,)  i.  i„  che 

objective  function.  Thu.  we  heve  two  very  .ieil.r  .^od.nin,.  who..  XP 
r.l.xetion.  ere  co.p,.„e„t.ry  in  th.t  the  clo.er  the  proxi.it,  oi  the  FB 
IF  r.!...tion  to  the  ori,i„.l  MIP  eet.  the  wore,  the  proxi.ity  of  the  PC 
bF  r.lex.tion  i.  to  it.  ortin..  m,p 
nocion. 


X. 


3 


X. 

3 


Figure  4.2 

L.P.  Relaxations  (Convex  Hulls) 


Notice  the  shaded  area  representing  the  LP  relaxations  of  both  problems. 
First  notice  that  the  difference  between  the  original  representable  set 
Sj  (4.4)  and  the  LP  relaxation  (4.6)  is  much  greater  for  the  FC  problem 
than  the  FB  problem.  Thus,  if  our  conjecture  is  correct,  the  typical 
fixed  charge  problem  is  more  difficult  than  the  typical  fixed  benefit 
problem.  Suppose  we  depart  from  the  typical  case  in  which  Sj  is  small 
with  respect  to  Mj,  and  increase  the  value  of  the  minimum  usage  level  to 
one-half  of  the  maximum  usage  level.  The  LP  relaxations  now  become. 


Figure  4.3 

Effect  of  Minimum  Usage  Level 
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Notice,  the  FC  LP  relaxation  is  unchanged  while  the  FB  LP  relaxation  does 
change.  More  importantly  the  set  -  (clconv(Sj)  -  S^),  while  larger  in 
both  cases  as  6^  increases,  its  sire  increase  is  larger  in  the  FB 
problem.  Therefore  we  expect  that  as  increases,  the  computational 

advantage  of  the  FB  problem  may  decrease.  Our  tests  confirm  our 
expectations. 

Experiments 

We  run  several  samples  of  the  problems  described  by  (4.1),  (4.2), 
and  (4.3)  and  modelled  using  the  “extreme  point”  formulation  (4.5).  We 
solve  two  different  sizes  of  problems,  the  first  (1)  contains  10  original 
variables  and  20  original  constraints  (i.e.  A  matrix  Is  20  x  10),  the 
second  (II)  contains  20  variables  and  40  constraints  (A  matrix  is  40  * 
20).  Using  the  extreme  point  formulation,  the  problems  of  size  I,  have 
50  constraints  and  80  variables,  20  of  which  are  binary;  size  II  problems 
have  100  constraints,  160  variables,  of  which  40  are  binary  variables. 

For  each  problem  we  randomly  choose  each  a^j  as  an  integer  between 

n 

1  and  10.  Each  b.,  (i»l,n)  is  selected  as  b.  *  la..,  thus  x.  =  1  for 

"  "  j-1  J 

all  j,  is  a  feasible  solution.  Next,  we  randomly  choose  each  c.  as  an 

integer  between  5  and  10.  For  some  problems,  the  fixed-charge  (benefit) 

s.  is  30%  of  the  total  cost  of  setting  a  variable  to  one.  i.e.  s.  * 

J  J 

0.3(s^+c^).  For  other  problems  the  percentage  of  the  fixed-charge 
(benefit)  is  increased  to  50%  of  the  total.  The  maximum  usage  level  (Mj) 
is  4.0,  while  the  minimum  usage  level  (6-)  varies  between  6,  »  0.1,  5.  * 
1.0,  and  6j  ■  2.0.  Thus,  the  minimum  usage  level  varies  from  2.5%  to  50% 
of  the  maximum  usage  level. 
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After  the  problem  parameters  are  thus  randomly  chosen,  we  create  a 
pair  of  problems.  In  the  first  member  of  each  pair,  the  parameters  are 
used  to  construct  a  fixed-charge  problem  as  described  above.  The  second 
member  of  each  pair  is  made  into  a  fixed-benefit  problem  with  exactly  the 
same  data. 

Along  with  the  usual  statistics,  we  compute  the  ratio  of  the  fixed- 
Benef it/Fixed-Charge  solution  times.  The  results  are  summarized  in 
Tables  14  through  16.  Table  14  summarizes  problems  of  size  I  and  II, 
with  a  minimum  usage  level  of  2.0,  and  a  fixed  cost  prcentage  of  301!. 
Table  15  represents  problems  of  size  I  and  II  with  minimum  usage  levels 

of  1.0  and  fixed  cost  percentage  of  30%.  Table  16  summarizes  problems  of 
both  sizes  with  minimum  usge  levels  of  0.1  and  a  50%  fixed  cost 
percentage. 

As  we  predict,  the  FB  problem  solves  faster  and  more  efficiently 
than  the  FC  problems.  But  the  results  are  not  as  dominant  as  one  might 
expect.  First,  notice  that  the  (LP/Best  Discrete  Solution)  ratio  is  over 
0.9  for  both  FB  and  FC,  thus  the  problems  are  "easy"  fixed-cost  problems 
to  begin  with.  Problems  with  lower  ratios  as  in  Chapter  III  may  produce 
more  dramatic  results.  From  Tables  13  and  14  we  find  that  when  the 
minimum  usage  level  is  significant  with  respect  to  the  upper  bound,  the 
FB  problems  are  not  much  faster,  but  are  more  efficient.  In  fact,  in 
some  instances,  the  FC  problem  solves  faster  even  though  it  requires  mere 
nodes  to  solve  the  problem  than  its  FB  counterpart.  As  our  Fig* 

^.3  suggests,  problems  with  small  minimum  usage  levels  are  easily 
solved  as  FB  problems  (see  Table  15). 


s' 

s' 
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An  interesting  discovery  among  all  samples  is  that  the  FB 

representation  is  harder  to  solve  as  a  linear  program  than  the  fixed 

charge  problem.  We  believe  this  is  partially  due  to  the  fact  that  the  Cj 

values  are  at  least  as  great  as  the  z.  coefficient  (s.)  values. 

J  3 

Therefore  in  the  FC  relaxation  problem,  once  the  Ax  >  b  constraints  are 

satisfied  then  the  X. variables  are  set  to  X.  *  x./4.  But  for  the  FB 

J  3  3 

problem,  the  code  will  initially  try  to  set  all  X^  variables  to  their 
upper  limit  of  one,  to  obtain  the  benefits,  which  forces  the  x^  values  to 
be  at  least  the  minimum  usage  level.  But,  since  increasing  the 
XjVariables  adds  the  relatively  large  c^  costs,  the  code  then  re-adjusts 
the  Xj  values,  which  tends  to  increase  the  number  of  iterations  in  the  LP 
problem. 


Table  14.  Fixed-Charge  vs.  Fixed-Benefit 
•  2.0,  Z  Fixed-Cost  -  30% 

using  Martin's  BANDBX  Code 


Solution  Time  (CPU) 


Problem  FC  FB 

Size  LP  Total  LP  Total 
(#) 


NODES  SOLVED 


LP/Discrete 

Solution 

Ratio 


FC 

FB 

FC 

FB 

To 

To 

Find  Total 

Find  Total 

31  50 

17  30 

.952 

.979 

152  206^ 

0^ 

00 

00 

0^ 

.970^ 

.988^ 

^neither  problem  was  solved  within  600  CPU  seconds 

**only  one  of  the  two  problems  solved  within  600  CPU  seconds 

^Total  Nodes  unknown  as  problems  were  not  completely  solved. 

2 

Best  Discrete  Solution  found  at  time  of  shutoff  is  used  in  ratio. 
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Table  15.  Fixed-Charge  vs.  Fixed-Benefit 
di  •  1.0,  X  Fixed-Cost  ■  302 

Using  Martin's  BANDBX 


Solution  Time  (CPU) 

NODES  SOLVED 

LP/Discrete 

Solution 

Ratio 

Problem 

Size 

(#) 

FC 

LP  Total 

FB 

LP  Total 

FB/FC 

Ratio 

Time 

FC 

To 

Find  Total 

FB 

To 

Find  Total 

FC 

FB 

I  (3) 

1.43  23.1 

3.37  9.42 

0.41 

32  37 

8  9 

0,961 

0.995 

II  (2) 

8.35  337.1 

23.8  208.3 

0.62 

67  117 

43  52 

0.976 

0.997 

-•.v; 


Table  16.  Fixed-Charge  vs.  Fixed-Benefit 
6-  »  0.1,  2  Fixed-Cost  ■  502 

Using  Martin's  BANDBX 


Solution  Time  (CPU) 

NODES  SOLVI 

iD 

LP/Dis 

Soluti 

Ratio 

screte 

Lon 

Problem 

Size 

(#) 

FC 

LP  Total 

FB 

LP  Total 

FB/FC 

Ratio 

Time 

FC 

To 

Find  Total 

FB 

To 

Find  Total 

FC 

FB 

1.23  51.2 

2.81  2.93 

0.06 

89  95 

1  1 

0.933 

1.0 

II  (2) 

6.92  432.4 

24.6  24.8 

0.06 

81  167 

1  1 

0.969 

1.0 

II  (2) 

6.92  432.4 

24.6  24.8 

0.06 

81  167 

1  1 

0.969 

1.0 
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CHAPTER  V 

TWO  TESTS  OF  PROPOSITIONAL  LOGIC  PROBLEMS 

Introduction 

The  purpose  of  this  chapter  is  to  explore  the  effectiveness  of  our 
Sharp  modellings,  and  "standard"  modellings,  in  problems  of  propositional 
logic  (see,  e.g.  (2]  or  [33]  for  a  discussion  of  this  logic). 

The  first  test  is  entitled  "Satisfiability  Testing."  In  this 
experiment,  we  are  given  a  formula  in  conjunctive  normal  form.  (i.e.  as 
a  conjunction  of  a  disjunction  of  literals  ~  a  literal  being  a 
propositional  letter  or  its  negation)  We  wish  to  determine  whether  the 
formula  is  satisfiable. 

The  given  information  for  the  second  experiment  is  a  set  of 
implications,  L^,  for  i*l,...,n  where  each  L^  is  a  disjunction  of 

literals,  and  is  a  conjunction  of  literals  in  some  experiments,  and  a 
disjunction  of  literals  in  others.  We  wish  to  determine  whether  the  set 
of  implications  imply  that  a  particular  literal  is  true,  provided  certain 
other  literals  are  all  given  as  true.  The  literals  last  mentioned  are 
part  of  the  data  of  the  problem. 

Our  interest  in  satisfiability  testing  lies  in  its  role  as  an 
important  problem  type  for  theoretical  analysis,  particularly  the  NP- 
completeness  theory.  Our  interest  in  implication  testing  lies  in  the  use 
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of  implicotions  in  the  "expert  syetems"  of  artificial  intelligence  (14, 
15]. 

In  forcing  certain  letters  Aj  to  be  true  and  a  letter  B 

to  be  false,  with  all  implications  ^^“^****»*'^  true,  and 

determining  if  the  overall  result  is  consistent,  we  are  in  fact 
ascertaining  if  B  is  forced  to  be  true  when  the  "data"  on  Aj  are 

as  forced  and  the  "production  rules"  *  I'x  ^i*l»***»*'^  valid.  In 
actual  expert  systems,  more  complex  relationships  are  modelled  than  those 
which  can  be  represented  by  propositional  logic,  but  it  is  the  expert 

system  paradigm  which  motivates  us. 

In  the  problem  studied  here,  the  MIP  code  is  used  only  as  a 

consistency  tester  (i.e.,  there  is  no  criterion  function). 

Both  propositional  logic  experiments  are  MIP  representable,  in 
fact,  both  can  be  modelled  as  pure  integer  programs.  However,  it  is 
unnecessary  to  actually  declare  all  variables  as  integer,  thus  the 
problems  we  solve  are  mixed-integer  programs.  One  of  our  initial  goals 
was  to  develop  an  efficient  branch-and-bound  algorithm  to  solve  these 
problems,  but  we  are  surprised  to  find  that  standard  branch-and-bound 
code  (Martin's  BANDBX  and  APEX  IV)  solve  the  problems  very  effectively. 

In  fact,  as  shown  later,  we  are  forced  to  alter  our  random  generator  to 
artificially  create  difficult  problems  in  order  to  compare  the 
computational  effort  of  different  modellings  used. 

Both  modellings  solve  the  problems  so  efficiently  that  we  stop 
comparing  modellings  and  concentrate  solely  on  creating  difficult  problem 
instances.  I.e.,  for  satisfiability  problems  of  the  sire  we 


■-w.j 


study,  there  is  no  advantage  to  the  newer  methods,  since  random 
propositional  formula  are  easy  to  decide  by  branch-and-bound . 
Problems 
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The  propositional  logic  formulas  for  the  satisfiability 
experiments  are  developed  in  the  following  manner.  Given  the  total 
number  of  literals  to  choose  from,  t,  the  number  of  literals  per  clause, 
s  (s  <  Jf),  and  the  total  number  of  clauses  in  the  formula,  c,  we  randomly 
generate  c  clauses  of  size  s  in  the  i  literals  and  test  whether  the 
resulting  formula  is  satisfiable.  For  instance,  if  1*5,  s*2,  and  c=3, 
the  following  formula  may  be  randomly  generated. 


K  -  (A,  V  -  Aj  A  (A,V  A  )  A  (A  V  A)  A  (A,  V  A  ),  (5.1) 

2  5  1  5  1  2  4  5 


where  "V"  indicates  "or,"  "A"  indicates  "and",  also  implies  negation 
of  the  literal  A^.  To  model  the  formula  as  a  mixed-integer  program,  we 
assign  binary  variables  to  each  literal  of  the  formula  such  that  if  A^^  is 
true  then  its  associated  variable  ■  1,  similarily  if  A^  is  false  then 
x^  ■  0.  Negation  of  literals,  are  represented  as  1-x^.  To  model 

(5.1)  we  first  model  the  individual  clauses,  i.e.  (A^  V  A2  .••  V  A^). 
Modelling  the  V-connective 
For  an  expression  of  the  form 


A  -  AjV  A^  V  ...  Aj., 


(5.2) 


where  each  Aj  has  been  assigned  a  variable  and  a  variable  x  is 
assigned  to  A,  a  non-sharp  modelling  follows: 
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X  >  (Xj  ♦  x^  x^)/t  (5.3) 

X  <  Xj  +  Xj  +. . .♦  x^. 

Our  Sharp  modelling  of  the  expression  is; 

X  >  x .  for  ,t  (5.4) 

J 

X  <  X^+  x^  ♦. . .+ 

The  non-sharp  model  (5.3)  is  similar  to  some  representations  which 
have  been  used  (largely  unsucessfully)  in  practice. 

The  model  (5.4)  which  we  are  calling  "Sharp"  here  is  similar  to 
one  used  successfully  in  practice  in  a  setting  where  the  logical 
constraints  were  part  of  a  much  larger  group  of  constraints  [Johnson, 
et  al.,  [31],  and  has  the  same  character  as  the  "disaggregated" 
constraints  for  a  distribution  center  with  a  fixed-charge  [Graves  & 
Geoff.,  19].  In  both  settings  it  is  crucial  to  successful 
implementation. 

We  call  (5.4)  "sharp"  by  extension  of  terminology.  The  set  S  to 

be  modelled  has  the  form  S  •  Pj  [J  S',  where  Pj  ■  {x,  Xj,..,x^)l 

X  *  Xj  ■  ...  *  x^  ■  0)  is  a  polyhedron  and  S'  •  {(x,  x^ . x^  |  x  *  1, 

Zxj  >  1,  all  X.  e{0,l})  is  a  bounded-MIP  representable  set  (in  fact,  the 
t-1 

union  of  2  polyhedra).  We  relax  S  to  the  polyhedron 

P^  •  {(x,  X . .  )  1  X  *  1,  Ex.  >  1,  0  4  X,  41  all  j)  (note  that 

2  I  t'  J  J 

that  ^2  “  conv  (S'))  and  then  we  model  Pj  U  P2  by  the  polyhedral  method 


as  follows: 
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(1)  (2) 

X  -  X  ♦  X 

(1)  (2) 


(1) 

X  ■  0»x, 


(2) 

X  ■  1  • 


(5.5) 


0»Xj  all  j  IXj  >  1  •X2 


0<Xj  <1»X2  allj 


•  ,r-  u  «  ^2)  (2) 

upon  simplification,  we  have  x  *  0  +  X2  *  X2,  Xj  ■  0  +  x-  “ 

(2) 

we  note  that  “  *j  <  X  2  *  x.  This  gives  (5.4)  which  is  sharp  for 
the  given  relaxation  U  P2  S  *  P ^  U  S ". 

All  literal  variables  (x^)  are  declared  binary,  while  the  formula 

variable  (x)  is  not  declared  integer  even  though  it  is  forced  to  integer 
values.  Notice  our  Sharp  modelling  involves  (t+1)  constraints  while  the 
non-sharp  model  requires  only  2  constraints  per  expression. 

Modelling  the  "A”  connective.  For  an  expression  of  the  form: 


A  *  Aj  A  A2  A  . . .  A  A^ 


(5.6) 


where  each  literal  has  already  been  assigned  a  binary  variable  x.,  and  A 
is  assigned  a  variable  x,  the  model  is  developed  in  the  following  manner. 
Since  (5.6)  is  equivalent  to 
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with  all  variables  non-negative.  Assigning  as  the  multiplier  for  the 
left  side  of  (3.10)  and  as  the  right  side  multiplier,  our  polyhedral 
form  for  MIP  representations  simplifies  to: 


X  ■  X 


X  ^  <  X  j=l , . . .t 


I  X.  >  X 

j-1  ^ 

X  e  {O.U. 

which  simplifies  to  (5.4). 

Combining  the  modellings  of  the  "V"  connective  and  the  "A" 
connective  allows  us  to  model  conjunctive  normal  form  formulas  and,  in 
fact,  any  propositional  expression  in  'V',  "A,”  and  We  next 

illustrate  this  with  the  formula  (5.1) 

The  non-sharp  representation  is, 


♦  (1-Xj)  >  x^ 

X2  *  (1-Xj)  c  2x^ 


(5.11) 


X,  ♦  Xj  >  x^ 

*1  *  *2  ‘  ^’‘7 


•  “S  ’  'b 
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X. 

<  2x„ 

4 

5 

6 

X, 

*  x_ 

+  x„  >  3x„ 

6 

7 

8  9 

~x„ 

♦  X- 

+  X,  +x„  < 

2 

9 

6 

7  8 

Xi, 

^2  * 

X,  ,  X,  are 
4’  5 

binary 

all  variables  >  0. 

In  the  above,  x, ,  x_  x„  are  variables  assigned  to  each  of  the  clauses  of 

the  formula,  and  x^  is  assigned  to  K.  Therefore,  once  the  constraint 

x^  1  is  added  to  the  constraint  set  (5.11),  the  formula  (5.1)  is 

satisfiable  if  and  only  if  the  constraint  set  is  consistent. 

Modelling  the  Implication.  The  implication  testing  experiment 

requires  all  implications  (K^  L^)  to  be  true,  with  (K^  ♦  L^)  equivalent 

to  (-  K.  V  L.).  Therefore  each  implication  is  modelled  as  an  "V" 

11 

connective.  We  next  work  an  example. 

The  implication 

(~  A  A  B  A  ~  C)  ♦  -  D  (5.12) 

is  transformed  to  the  disjunction 

AV^BVCV'-D  (5.13) 

(since  (-AABA-OisfAV-BVC)  ).  Then  (5.13)  is  modelled 

as 


143 


x,+(l-X-)+x-+(l-x, )  >  1  (5.14) 

1  i  3  4 

and  (x,+(l-x,)+x-+(l-x  )]/4  <  1 
1  z  3  4 

Since  the  implication  is  required  to  be  true,  in  (5.14)  we  have 
substituted  '1'  for  the  variable  x  which  would  (in  a  setting  of  nested 

logical  connectives)  have  been  assigned  to  the  entire  subformula  in 

(5.13) . 

The  second  constraint  in  (5.14)  is  equivalent  to  x^  -  +  x^  -  x^ 

<  2,  which  is  redundant  for  0  <  x^  <  1,  so  it  can  be  dropped.  If  (5.9) 
is  used  instead  of  (5.3)  to  model  (5.13),  the  second  constraint  in 

(5.14)  becomes  four  constraints,  specifically:  1  >  x^,  1  >  1  -  x^,  1  > 
Xj,  1  >  1  -  x^.  Each  of  these  is  redundant,  and  can  be  dropped. 
Therefore,  in  this  example,  due  to  simplifications  both  (5.3)  and  (5.4) 
yield  the  same  modelling,  i.e.  the  first  constraint  in  (5.14),  called  a 
"generalized  sst  covering  constraint,"  Many  of  our  runs  are  done  via 
such  constraints,  with  no  difference  betweeft  (5.3)  and  (5.4).  By  using 
this  modelling,  with  all  propositional  letters  declared  binary,  we  have 
one  constraint  per  implication.  In  effect,  we  transform  our  logic 
problem  into  a  generalized  set  covering  problem. 

In  some  other  experiments  reported  below,  there  will  be  a 
difference  between  (5.3)  and  (5.4)  formulations. 

For  example,  to  model  the  implication 

(A  V  B  V  -  C)  ♦  D  (5,15) 


we  reduce  it  to 
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(-‘AA-BAOVD 

(as  ~  (a  V  B  V  ~  C)  is  A  A  B  AC),  which  has  a  nested  structure.  We 

introduce  a  binary  variable  x  for  the  subformula  ~  A  A  -  B  A  C,  and  the 

modelling  via  (5.3),  (5.7)  becomes 

3x  <  (1-Xj)  ♦  (l-x^)  ♦  x^  (5.16) 

-X  ♦  (l-x.)  +  (1“X^)  ♦  X  <2 

1  2  3 

1  >  (%*x^)/2 
1  <  X  ♦  X 

4 

The  modelling  via  (5.4),  (5.9)  becomes 

X  <  1  -  Xj  (5.17) 

X  <  1  -  x^ 

X  (  X^ 

1  >  X 

1  <  X  ♦  X, 

4 

In  (5.16),  the  third  constraint  is  redundant  and  in  (5.17)  the  third  and 
fourth  constraints  are  redundant.  However,  even  with  simplifications,  we 
obtain  different  representations. 

In  general,  if  what  preceeds  the  implication  (i.e.  in  ♦  L^) 
is  a  conjunction  of  literals,  simpl i f ict ions  result  in  the  same 
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generalised  set  covering  problem  for  both  sharp  and  nonsharp 
formulations.  If  disjunctions  occur  there,  even  after  simplifications, 
the  formulations  are  different. 

In  our  experiments,  we  try  runs  that  included  the  simplifications, 
some  of  which  are  the  same  (i.e.  generalized  set  covering)  for  both  the 
sharp  and  nonsharp  representations,  and  some  of  which  are  different.  In 
addition,  in  some  runs  we  do  not  make  the  simplifications  which  are 
possible  and  deliberately  left  redundant  constraints  in  the  program, 
simply  to  see  what  happens  when  these  methods  are  used  "blind." 

Furthermore,  different  codes  are  used  on  different  runs.  The 
initial  runs  on  Martin's  code  were  so  fast  that  we  decided  to  "size  up" 
Our  problems,  and  went  to  APEX  IV  when  it  arrived  on  the  Tech  campus  in 
September  1983.  As  we  shall  see,  the  runs  continue  to  be  fast, 
regardless  of  the  formulation. 

As  we  discuss  the  experiments  in  what  follows,  we  will  specify 
what  formulations  are  used,  whether  or  not  simplications  are  made  and 
what  code  is  used. 

We  remark  that,  in  the  "expert  systems"  when  implications  like 
Ki  ♦  occur,  typically,  is  a  conjunction  of  literals  and  is,  in 
fact,  a  single  literal,  and  it  is  rare  for  more  than  six  or  seven 
literals  to  occur  in  one  implication.  Such  simple  implications  are  of 
course  of  the  generalized  set  covering  variety.  However,  more  complex 
implications  can  also  occur. 

Problem  Generators.  Each  clause  within  both  the  Satisfiability 
and  Implication  Testing  experiments  is  generated  by  randomly  selecting 


the  s  literals  to  appear  in  the  clause.  The  literals  are  selected  in  the 
following  manner.  First,  a  random  number,  r,  in  (0,1.0)  is  selected, 
then,  for  the  first  literal  of  the  clause,  the  interval  (0,  1.0)  is 
divided  into  A  equally-sired  intervals.  The  interval  in  which  r  belongs 
determines  the  first  letter  of  the  clause.  To  determine  the  next  letter 
a  new  r  is  generated  and  the  interval  is  now  divided  into  (A-i) 
intervals,  insuring  that  any  one  letter  is  never  selected  twice  in  the 
same  clause.  Once  the  clause  consists  of  s  literals,  we  determine 
whether  each  will  appear  as  a  letter,  or  the  negation  (-vA^)  of  the 
letter,  with  equal  probability. 

For  the  Satisfiability  tests,  the  first  occurrence  of  any  literal 
is  randomly  determined  to  be  either  a  letter  or  its  negation,  each  having 
equal  probability  of  occurring.  After  the  first  occurrence,  each 
subsequent  occurrence  is  the  opposite  of  the  preceding  logical  type 
(i.e.,  if  the  first  occurrence  of  A1  is  as  A1 ,  the  next  is  ~A1,  the  next 
Al).  The  selection  technique  is  used  to  artificially  make  the  problems 
difficult. 

Once  "c"  independent  clauses  of  the  Satisfiability  test  are 
identified,  each  literal  is  scanned  to  determine  the  number  of  times  it 
occurs.  Any  literal  that  appears  only  once,  in  one  clause,  is  eliminated 
along  with  the  entire  clause.  The  scanning  routine  continues  eliminating 
literals  and  clauses  until  each  literal  of  the  formula  appears,  as  either 
a  true  literal  or  its  negation,  in  at  least  two  separate  clauses.  This 
last  "pruning”  step  is  to  make  the  problems  harder  for  humans  to  solve. 
After  all,  a  letter  which  appears  in  only  one  clause  can  easily  be  given 
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a  truth  value  to  make  that  clause  true,  thus  simplifying  the  problem.  We 
note,  however,  that  we  had  earlier  done  some  trial  runs  without  pruning, 
and  this  option  (used  below)  does  not  seem  to  make  any  difference  to 
machine  solution. 

The  Implication  Testing  experiment  utilizes  a  different  problem 
generator.  The  individual  clauses  (K^,  and  L^)  are  generated  randomly  as 
in  the  Satisfiability  experiment.  However,  whether  a  literal  occurs  as  a 
true  literal  or  its  negation  is  randomly  determined  for  each  occurrence. 
Each  implication  consists  of  "A”  clauses  on  the  left  (K^)  and  "V"  clauses 
on  the  right  (L^).  (e.g.  (A1  A  A2)  (Al  V  A3). 

Given  the  total  number  of  literals  present  in  the  generated 
Implications  problem,  we  fix  a  certain  percentage  of  these  letters  to  be 
either  true  or  false.  Then  we  test  whether  the  overall  system  of 
implications  and  settings  are  consistent.  However,  to  increase  problem 
difficulty  we  require  that  at  least  one  literal  per  implication  be  free 
(i.e.  not  fixed).  In  some  tests  we  increase  the  number  of  free 
variables  per  implication  to  three. 

Results 

Satisfiability  Experiment.  The  most  significant  result  of  our 
satisfiability  experiments  is  the  ease  of  their  solution  as  mixed^integer 
programs.  Tables  17  and  18  summarize  our  experiences  using  the  BANDBX 
and  Land-Powell 's  code,  respectively.  The  actual  problem  sizes  ranges 
from  114  constraints,  110  variables,  38  binary  for  the  nonsharp  modelling 
of  Table  18,  to  315  constraints,  132  variables  43  of  which  are  binary  in 
the  ninth  problem  of  Table  17.  Problems  sizes  vary  with  the  total  number 
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of  literal  negations  present,  but  for  an  approximate  size  use  the 
following  formulas. 

Constraints  Variables  Binary 

Sharp  4C  ♦  L  2L  +  C  L 

NonSharp  2C  ■*’  L  2L  C  L 

Aside  from  being  extremely  easy,  even  after  our  attempts  to 
"harden"  the  problems,  notice  the  sharp  modellings  are  slower  than  the 
nonshorp  modellings  in  nearly  all  instances.  This  is  to  be  expected  when 
the  sharp  formulation  in  this  case  is  not  better  than  the  nonsharp 
formulation  and  yet  has  more  constraints  (recall  that  problems  of  this 
type  are  simply  generalized  set  covering  problems).  The  sharp  modelling 
appears  to  perform  worse  than  the  nonsharp  modelling  as  the  number  of 
literals  per  clause  increases  (Table  17).  On  the  other  hand,  an  increase 
in  the  number  of  clauses  appears  to  affect  the  nonsharp  modelling 
slightly  more  than  the  sharp  (Table  18). 
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Table  17.  Satisfiability  Tests  Using  BANDBX. 


1 

Time  (CPU 

.  Seconds ) 

Problem 

1  Total  Nodes  | 

1  Sharp  1 

Non 

Sharp 

Size 

Satisf iable? 

Sharp 

Non  Sharp 

LP 

Total 

LP 

Total 

L*31,C»44,S-2 

No 

2 

2 

9.9 

11.8 

4.0 

6.5 

L»35,C-45,S-2 

No 

3 

3 

12.8 

21.5 

4.7 

9.8 

L-37,C»45,S»2 

Yes 

1 

4 

8.7 

8.8 

4.7 

10.8 

L«36,C-52,S-2 

No 

3 

3 

16.0 

27.0 

6.0 

14.4 

L*46,C»63,S-2 

No 

2 

3 

25.7 

29.8 

8.7 

21.3 

L*53,C-68,S-2 

Yes 

Too 

Large 

3 

model 

too  large 

10.1 

21.5 

L*36,C-39,S«3 

Yes 

3 

1 

10.1 

13.4 

3.9 

4.1 

L»38,C-45,S-3 

Yes 

3 

2 

18.1 

29.6 

5.5 

6.5 

L-43,C-45,S»4 

Yes 

2 

1 

14.6 

16.6 

5.2 

5.4 

L-40,C«45,S«4 

Yes 

2 

2 

34.4 

36.6 

5.3 

7.0 

L-25,C-35,S»5 

I 

1  Yes 

1  1 

1 

173.4 

173.5 

1 

3.6 

Table  18.  Satisfiability  Tests  using  Land-Powell 's  Code. 


Problem 

Size 

Sat isf iable? 

Time  (CPU 

Seconds ) 

Toti 

*1  Nodes 

Sharp 

Non  Sharp 

Sharp 

Non  Sharp 

LP 

Total 

LP 

Total 

L*38,C«A0,S«2 

Yes 

2 

1 

8.4 

9.7 

3.4 

3.5 

L-42,C-40,S«2 

Yes 

1 

1 

7.1 

7.3 

3.6 

3.7 

L»44,C»45.S-2 

Yes 

2 

2 

9.7 

12.6 

4.8 

6.5 

L«45,C-45,S»2 

Yes 

5 

9.1 

11.0 

4.7 

13.0 

L»44,C-52,S-2 

Yes 

1 

1 

12.5 

12.6 

5.5 

5.6 

L-43,C«55,S-3 

No 

2 

3 

14.7 

17.1 

6.3 

13.7 

L»45,C*60,S»3 

No  1 

2 

3 

19.3 

22.0 

7.6 

13.4 

L-45,C»60,S-4 

No 

1 

2 

3 

1 

17.7 

23.9 

7.3 

15.5 

J51 


Implications  Testing  Experiment.  This  experiment  is  performed 
entirely  using  CDC's  APEX  IV  mixed-integer  programming  code,  and  as  in 
the  Satisfiability  experiments,  the  branch-and-bound  code,  with  few 
exceptions,  solves  these  problems  easily.  The  implications  studied  here 
are  of  the  generalized  set-covering  type.  In  Tables  19  -  24  we  do  not 
apply  our  rule  of  having  at  least  one  free  literal  per  implication,  and 
the  results  show  these  random  problems  to  be  very  easy.  Recall  that  each 
implication  requires  only  one  generalized  set  covering  constraint. 
Therefore  the  number  of  constraints  in  these  problems  equals  the  number 
of  implications  plus  one  constraint  for  each  literal  fixed  at  either  true 
(*1)  or  false  (*0). 

Since  we  do  not  keep  track  of  how  many  letters  in  a  given  clause 
are  free  (not  fixed),  it  is  possible  that  all  letters  (or  all  but  one)  in 
many  clauses  are  fixed.  Such  problems  tend  to  be  very  easy  for  humans  to 
solve  by  a  scanning  procedure  similar  to  our  "pruning."  Therefore  in  the 
next  experiment  we  insure  that  the  letters  left  free  provided  instances 
of  NP-hard  problems. 

Table  20  summarizes  problems  in  which  there  are  five  literals  in 
each  K.  and  2  in  each  L.,  an  example  is  (A1  A  ~A3  A  A4  A  A5)  ♦  (A2  V  A3). 

1  I 

Each  implication  must  have  at  least  three  literals  not  assigned  an 
initial  value.  This  requirement  greatly  restricts  the  literals  which  we 
are  able  to  assign  as  true  or  false.  In  the  seventh  example  we  are  able 
to  initialize  only  eleven  of  the  fifty  literals.  In  the  "Total  Fixed" 
column  we  indicate  that  each  problem  instance  is  solved  twice.  First  the 


Table  19.  Implications  Testing  Using  APEX  IV 
No  Free  Variables  Required. 


Problem  Parameters 

rals/ 

se 

L 

Total 

Fixed 

Consistent 

? 

Nodes 

Total 

Time 

APEX  Units 

LP  Total 

Total  1 
Implications 

P  of 

Literals 

Lite 

Clau 

K 

300 

295 

3 

1 

60 

Yes 

1 

7.8 

8.2 

300 

250 

3 

2 

25 

Yes 

1 

2.2 

2.6 

100 

100 

3 

1 

10 

Yes 

i 

0.9 

1.2 

300  1 

200 

3 

1 

20 

Yes 

1 

2.2 

2.7 

300 

294 

3 

1 

30 

Yes 

1 

2.2 

2.7 

200 

198 

3 

1 

20 

Yes 

1 

1.7 

2.2 

300 

299 

3 

1 

90 

Yes 

1 

10.7 

10.9 

300 

300 

3 

1 

120 

Yes 

1 

14.2 

14.6 

300 

293 

3 

1 

2 

Yes 

1 

2.5 

2.9 

300 

293 

3 

1 

150 

No 

1 

INFEAS 

4.0 

300 

293 

3 

1 

120 

Yes 

1 

3.7 

4.0 

300 

298 

3 

1 

120 

Yes 

1 

3.0 

3.3 

300 

292 

3 

1 

150 

No 

1 

INFEAS 

3.3 

300 

298 

3 

1 

150 

No 

1 

INFEAS 

4.6 

300 

298 

3 

1 

1 

150 

No 

1 

INFEAS 

17.7 

I . .  -  ..  > 


..V 


■j» 
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Table  20.  Implication  Tests  Using  APEX  IV, 
3  Free  Literals  per  Implication 


Prob] 

lem  Parameters 

Total 

Fixed 

Consistent 

? 

Nodes 

Total 

Time 

APEX  Units 
LP  Total 

Total  i 
Implications 

f  of 

Literals 

Literals/ 

Clause 

K  L 

300 

160 

5  2 

58  to  !*♦ 

Yes 

1 

1.4  1.8 

58  to  0++ 

Yes 

1 

6.0  6.4 

300 

120 

5  2 

38  to  1 

Yes 

1 

1.0  1.4 

38  to  0 

Yes 

2 

5.1  5.9 

300 

100 

5  2 

34  to  1 

Yes 

1 

1.3  1.7 

34  to  0 

Yes 

1 

4.2  4.6 

400 

100 

5  2 

29  to  1 

Yes 

1 

0.9  1.3 

29  to  0 

Yes 

1 

4.4  4.8 

400 

60 

5  2 

13  to  1 

Yes 

1 

1.1  1.4 

13  to  0 

Yes 

1 

2.6  2.9 

500 

60 

5  2 

14  to  1 

Yes 

2 

1.8  2.9 

14  to  0 

Yes 

3 

32.5  47.5 

500 

50 

5  2 

ll  to  1 

Yes 

1 

1.2  1.6 

H  to  0 

Yes 

2 

3.6  4.7 

600 

60 

5  2 

14  to  1 

Yes 

5 

4.2  35.3 

14  to  0 

Yes 

1 

25.3  25.7 

♦The  APEX  IV  code  was  shut  off  before  reaching  a  solution. 

♦♦This  notation  "15  to  1"  means  15  literals  are  given  as  true  (*1) 
and  one  is  given  as  false  (*0). 

++Thi6  notation  "15  to  0"  means  the  same  15  literals  which  were 
previously  given  as  true,  one  given  as  false  (*0).  Otherwise 
the  problems  within  each  set  of  lines  are  identical.  In  this 
case,  there  are  actually  59  literals  given  as  false  (*0). 


Table  21.  Implications  Tests,  Affects  of  Fixing  Literals 
1  Free  Literal  per  Implication. 


Problem  Parameters 


Literals/ 
Total  #  of  Clause 

Implications  Literals  K  L 


Consistent  Nodes |  APEX  Units 
?  Total 


INFEAS 

11.3 

INFEAS 

10.9 

INFEAS 

11.8 

INFEAS 

16.5 

INFEAS 

16.4 

INFEAS 

17.4 

INFEAS 

24.9 

INFEAS 

28.1 

INFEAS 

21.6 

INFEAS 

22.5 

INFEAS 

32.5 

INFEAS 

31.5 

INFEAS 

45.0 

INFEAS 

40.9 

INFEAS 

71.0 

INFEAS 

62.4 

INFEAS 

84.1 

e 


Yes 


13 


^  / 
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problem  is  solved  with  the  literals  assigned  to  be  true,  then  the 
identical  problem  is  solved  with  the  same  literals  assigned  tc  be  false 
(*0).  Since  our  random  generator  provides  true  and  negation  instances  cl 
each  literal  with  equal  probability,  whether  a  literal  is  assifed  a  i  r  _e 
or  false  value  should  not  create  more  difficult  problems .  Hcwever,  we 
find  that  when  literals  are  assigned  false  (*0)  values,  fe  ;r  . 
more  difficult  to  solve  using  the  APEX  IV  code.  Despite  t^;* 
unexplainable  phenomenon,  the  most  prominent  feature  of  tnese  re 
their  ease  of  solution.  Again,  what  is  easy  or  hard  let  *  -j  ^  ’ 

easy  for  machine  solution. 

In  table  21  we  solve  only  one  problem  instance,  but  vat.  t-f  r.ta 
number  of  literals  assigned  a  true  value.  Each  implication  ccnists  :• 
only  two  literals  in  each  clause,  and  each  Lj  is  a  singleton  clause. 
The  problem  becomes  more  difficult  as  it  approaches  the  feasible' 
infeasible  boundaries.  In  fact,  we  believe  that  some  of  the  hardest 
problems  are  at  this  boundary,  and  in  these  tests  we  are  trying  tc  create 
a  hard  problem.  Only  the  last  three  instances  found  the  LP  relaxation 
feasible,  and  these  three  instances  are  much  harder  than  the  more 
infeasible  instances.  The  last  entry  actually  found  a  feasible  solution, 
and  its  solution  time  is  much  faster  than  the  infeasible  example  just 
above  it. 

Using  ideas  from  our  experiment  in  connection  with  Table  21, 

Tables  22  and  23  depict  problems  with  fewer  literals  initially  fixed. 
These  problems  are  difficult,  one  instance  requires  over  2000  APEX  units 
of  time.  (Recall,  APEX  units  are  virtually  equal  to  epu  seconds.) 
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Table  22.  Implication  Tests  Using  APEX  IV, 
1  Free  Literal  per  Implication. 


Problem  Parameters 


Total  i 
Impl icat ions 

i*  of 

Literals 

Literals/ 

Clause 

K  L 

Total 

Fixed 

Consistent 

? 

Nodes 

Total 

Time 

APEX  Units 

LP  Total 

400 

100 

2 

1 

15 

to 

1** 

No 

3 

46.4 

71.8 

400 

100 

2 

1 

15 

to 

1 

No 

1 

INFEAS 

63.2 

15 

to 

0++ 

No 

7 

135.2 

193.9 

400 

100 

2 

1 

15 

to 

1 

No 

21 

77.7 

237.6 

15 

to 

0 

Yes 

3 

139.8 

173.4 

400 

100 

2 

1 

10 

to 

1 

No 

1 

INFEAS 

100.0 

10 

to 

0 

No 

1 

INFEAS 

147.7 

400 

100 

2 

1 

10 

to 

1 

No 

1 

INFEAS 

84.7 

10 

to 

0 

No 

243 

109.3 

2083.5 

400 

100 

2 

1 

5 

to 

1 

Unknown 

201* 

74.4 

557* 

400 

100 

2 

1 

5 

to 

1 

Unknown 

481* 

66.9 

634* 

400 

100 

2 

1 

1 

to 

1 

Unknown 

408* 

44.0 

716* 

*The  APEX  IV  code  was  shut  off  before  reaching  a  solution. 
**(see  Table  20) 


++(see  Table  20) 
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Table  23.  Implications  Tests  Using  APEX  IV 
1  Free  Literal  per  Implication. 


Problem  Parameters 


Total 

Implications 

#  of 
Literals 

Literals/ 

Clause 

K  L 

Total 

Fixed 

Consistent 

7 

Nodes 

Total 

Time 

APEX  Units 

LP  Total 

300 

100 

2 

1 

1  to  1 

Yes 

67 

27.9 

26]  .9 

1  to  0 

Yes 

8 

21.2 

53.0 

300 

100 

2 

1 

1  to  1 

Yes 

32 

25.7 

184.2 

1  to  0 

Yes 

13 

47.6 

131.0 

300 

100 

2 

1 

1  to  1 

Yes 

14 

11.0 

59.6 

1  to  0 

Yes 

16 

13.8 

92.9 

300 

100 

2 

1 

15  to  1 

Yes 

12 

15.9 

51.3 

15  to  0 

No 

3 

68.6 

81 .8 

300 

100 

2 

1 

15  to  1 

Yes 

21 

15.0 

107.6 

15  to  0 

No 

1 

INFEAS 

69.4 

300 

100 

2 

1 

15  to  1 

Yes 

S 

19.1 

61.3 

15  to  0 

Yes 

25 

59.2 

161.7 

300 

100 

2 

1 

15  to  1 

No 

13 

35.7 

92.5 

15  to  0 

No 

83 

64.6 

431.0 

300 

100 

2 

1 

15  to  1 

Yes 

7 

38.6 

80.1 

15  to  0 

Yes 

49 

44.4 

262.0 

300 

100 

2 

1 

15  to  1 

Yes 

19 

17.8 

82.4 

15  to  0 

Yes 

6 

39.0 

57.4 

300 

100 

2 

1 

15  to  1 

No 

2 

29.7 

35.6 

15  to  0 

Yes 

5 

50.4 

69.4 

300 

100 

2 

1 

15  to  1 

No 

36 

27.5 

192.5 

15  to  0 

Yes 

88 

93.8 

570.9 

In  this  manner,  we  succeed  in  producing  problems  which  are  hard  to 
solve,  apparently  by  being  at  the  feasible/infeasible  boundary.  What 
seems  hardest  for  the  machine  is  when  the  LP  is  feasible,  but  barely, 
while  the  IP  is  infeasible.  However,  since  these  problems  insure  only 
one  free  letter  per  rule,  they  may  have  been  easier  for  humans. 

Again  making  literals  false  produces  much  more  difficult  problems, 
although  in  many  cases  these  problems  require  much  fewer  branch-and-bound 
nodes.  As  a  first  attempt  to  explain  why  setting  literals  to  false 
results  in  more  difficult  problems,  we  alter  our  problem  generator  in 
favor  of  generating  true  literals.  In  Table  24  the  probability  of  a 
literal  appearing  as  a  negation  is  reduced  to  0.3.  For  these  problems  we 
expect  setting  literals  to  be  true  to  be  even  faster  than  before. 
Unfortunately  that  does  not  appear  to  be  the  case  as  2  of  the  6  problems 
actually  solve  faster  when  literals  are  forced  to  be  false. 

Table  25,  26,  and  27  summarize  the  results  of  implications  tests 
in  which  disjunctions  occur  in  the  clause  preceding  the  implication. 

(i.e.,  for  L^,  both  and  are  disjunctions  of  literals.)  This 

test  and  corresponding  integer  model  is  outlined  by  formulae  (5-15)  to 
(5-17).  We  shall  name  model  (5-16)  as  the  STANDARD  model,  and  model 
(5-17)  as  the  SHARP  model.  For  these  tests  we  alter  the  problem 
generator  such  that  each  occurrence  of  a  particular  letter  is  exactly  the 
opposite  (True  or  False)  of  its  previous  occurrence.  Thus,  these 
problems  instances  are  constructed  to  be  difficult  problems. 

As  (5”16)  shows,  the  STANDARD  model  requires  three  constraints  and 
one  additional  variable  per  implication.  From  (5-17),  we  see  that  the 


Table  26.  Implications  with  "ors"  on  LHS  and  RHS, 

1  Free  Literal  per  Implication,  Variables 


♦♦Notice:  after  1  hour  of  runnioR  time  this  problem  remains 
unsolved  using  the  STANDARD  model. 


SHARP  model  requires  (S^l)  constraints  and  an  additional  variable  per 
implication,  where  S  is  defined  as  the  number  of  letters  appearing  in  the 
disjunction.  Based  upon  well-known  results  concerning  constraint 
disaggregation,  (see  (19),  131],  [47]),  and  our  own  previous  experiments, 
we  expect  the  SHARP  formulation  to  outperform  the  STANDARD  modelling 
despite  its  larger  size.  Our  results  support  our  expectation. 

Tables  25  and  26  depict  identical  implication  problems  except  in 
Table  25  the  known  letters  are  given  to  be  true  (set  to  1)  whereas  in 
Table  26,  the  same  letters  are  given  as  false  (set  to  0).  The  size  of 
the  implication  problems  of  these  tables  result  in  modellings  consisting 
of  between  300  to  700  constraints  and  approximately  270-350  binary 
variables.  Some  problems  involve  a  singleton  on  the  right  of  the 
impliction  (L^is  a  singleton)  and  others  have  a  disjunction  of  two 
letters.  Notice  that  in  both  (5—16)  and  (5-17)  only  one  constraint  is 
affected  by  adding  disjunctions  on  the  right  of  an  implication. 

For  all  problems  in  Table  25,  the  SHARP  modelling  overcomes  its 
larger  size  and  solves  each  instance  faster  and  with  fewer  nodes.  In  one 
instance,  (the  third  sample),  the  SHARP  model  finds  the  LP  infeasible 
while  the  lack  of  sharpness  allows  the  STANDARD  to  solve  the  LP  and 
branch  to  11  nodes  before  reaching  the  same  conclusion.  The  node  counts 
indicate  at  least  a  4  to  1  reduction  in  favor  of  the  SHARP  model.  For 
these  samples,  the  SHARP  LP  is  solved  faster  than  the  smaller,  however 
more  dense,  STANDARD  LP  in  4  of  the  9  instances. 

Table  26  again  indicates  that  setting  the  given  variables  to  zero 
seems  more  difficult  to  APEX  IV  than  setting  the  same  variables  to  one. 
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Aside  from  that,  the  SHARP  modelling  is  again  faster  and  requires  at 
most  one-third  the  number  of  Branch-and-Bound  nodes.  For  harder  problems 
(see  instances  5,  6,  9)  the  SHARP  model  displays  at  least  a  15  to  1 
advantage  concerning  node  counts  and  in  problem  6,  the  node  comparison  is 
at  least  40  to  1.  The  actual  time  comparisons  are  unavailable  as  the 
STANDARD  model  fails  to  solve  the  instances  after  5  minutes  of  cpu  time, 

and  still  cannot  solve  problem  6  after  one  hour  of  cpu  time  (APEX 
units) ! 

In  two  inconsistent  problems,  the  inconsistency  is  discovered  with 
the  SHARP  LP,  whereas  the  STANDARD  modelling  requires  up  to  82  branch- 
and-bound  nodes  before  declaring  one  of  them  infeasible. 

Table  27  summarizes  problems  with  disjunctions  of  five  letters  on 
the  LHS  of  each  implication.  Therefore  each  implication  requires  6 
constraints  in  the  SHARP  (5-17)  modelling,  and  only  three  for  the 
STANDARD  (5-16)  modelling.  Even  though  the  SHARP  models  now  have  twice 
the  constraints,  and  their  corresponding  LP  relaxations  take  longer  to 
solve,  they  easily  outperform  the  STANDARD  formulation.  The  node  count 
comparison  reaches  70  to  1  in  one  instance  and  the  total  enumeration  time 
advantage  of  the  SHARP  modelling  increases  as  the  problems  become  more 
difficult. 
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constraint  disaggregation.  However  our  model  comes  automatically  whereas 
previous  disaggregations  involve  problem  preprocessing,  (see  (31]) 

We  are  able  to  generate  more  difficult  problems  by  exploring  the 
feasible/infeasible  boundaries.  Whether  these  intentional  efforts  to 
create  difficult  problems  is  more  or  less  realistic  requires  empirical 
work  with  actual  expert  systems.  The  results  clearly  indicate  that  these 
logic  problems  are  very  efficiently  solved  as  math  programs,  especially 
considering  that  the  mixed~integer  branch-and-bound  codes  employed  are 
not  specialized  binary  variables  codes,  but  general  mixed-integer  codes. 
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